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HUMAN POTASSIUM CHANNEL GENES 

Introduction 

Background 

5 Ion channels are multi-subunit, membrane bound proteins critical for 

maintenance of cellular homeostasis in nearly all cell types. Channels are 
involved in a myriad of processes including modulation of action potentials, 
regulation of cardiac myocyte excitability, heart rate, vascular tone, neuronal 
signaling, activation and proliferation of T-cells, and insulin secretion from 
10 pancreatic islet cells. In humans, ion channels comprise extended gene families 
with hundreds, or perhaps thousands, of both closely related and highly divergent 
family members. The majority of known channels regulate the passage of sodium 
(Na + ), chloride (Cl~), calcium (Ca~) and potassium (K*) ions across the cellular 
membrane. 

15 Given their importance in maintaining normal cellular physiology, it is not 

surprising that ion channels have been shown to play a role in heritable human 
disease. Indeed, ion channel defects are involved in predisposition to epilepsy, 
cardiac arrhythmia (long QT syndrome), hypertension (Bartter's syndrome), cystic 
fibrosis, (defects in the CFTR chloride channel), several skeletal muscle disorders 

20 (hyperkalemic periodic paralysis, paramyotonia congenita, episodic ataxia) and 
congenital neural deafness (Jervell-Lange-Nielson syndrome), amongst others. 

The potassium channel gene family is believed to be the largest and most 
diverse ion channel family. K* channels have critical roles in multiple cell types 
andpathways, and are the focus of significant investigation. Four human 

25 conditions, episodic ataxia with myokymia, long QT syndrome, epilepsy and 
Bartter's syndrome have been shown to be caused by defective K + ion channels. 
As the K* channel family is very diverse, and given that these proteins are critical 
components of virtually all cells, it is likely that abnormal K* channels will be 
involved in the etiology of additional renal, cardiovascular and central nervous 

30 system disorders of interest to the medical and pharmaceutical community. 

The K + channel superfamily can be broadly classified into groups, based 
upon the number of transmembrane domain (TMD) segments in the mature 

1 
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protein. The minK (IsK) gene contains a single TMD, and although not a channel 
by itself, minK associates with different 10 channel subunits, such as KvLQTI and 
HERG to modify the activity of these channels. The inward rectifying K+ channels 
(GIRK, IRK, CIR, ROMK) contain 2 TMD domains and a highly conserved pore 
domain. Twik-1 is a member of the newly emerging 4TMD K* channel subset. 
Twik-like channels (leak channels) are involved in maintaining the steady-state K* 
potentials across membranes and therefore the resting potential of the cell near 
the equilibrium potential for potassium (Duprat et al. (1997) EMBO J 16(17):5464- 
5471). These proteins are particularly intriguing targets for therapeutic regulation. 
The 6TMD, or Shaker-like channels, presently comprise the largest subset of 
known K* channels. The slopoke (slo) related channels, or Ca~ regulated 
channels apparently have either 10 TMD, or 6 TMD with 4 additional hydrophobic 
domains. 

Four transmembrane domain, tandem pore domain K+ channels (4T/2P 
channels) represent a new family of potassium selective ion channels involved in 
the control of background membrane conductances. In mammals, five channels 
fitting the 4T/2P architecture have been described: TWIK, TREK, TASK-1, TASK- 
2 and TRAAK. The 4T/2P channels all have distinct characteristics, but are all 
thought to be involved in maintaining the steady-state K* potentials across 
membranes and therefore the resting potential of the cell near the equilibrium 
potential for potassium (Duprat et al. (1997) EMBO J 16(17):5464-5471). These 
proteins are particularly intriguing targets for therapeutic regulation. Within this 
group, TWIK-1, TREK-1 and TASK-1 and TASK-2 are widely distributed in many 
different tissues, while TRAAK is present exclusively in brain, spinal cord and 
retina. The 4T/2P channels have different physiologic properties; TREK-1 
channels, are outwardly rectifying (Fink et al. (1996) EMBO J 15(24):6854-62) f 
while TWIK-1 channels, are inwardly rectifying (Lesage et al. (1996) EMBO J 
15(5): 1004-11. TASK channels are regulated by changes in PH while TRAAK 
channels are stimulated by arachidonic acid (Reyes et al. (1998) JBC 
273(47):30863-30869). 

The degree of sequence homology between different K* channel gen s is 
substantial. At the amino acid level, there is about 40% similarity between 
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different human genes, with distinct regions having higher homology, specifically 
the pore domain. It has been estimated that the K+ channel gene family contains 
approximately 10M0 3 individual genes. Despite the large number of potential 
genes, an analysis of public sequence databases and the scientific literature 
5 demonstrates that only a small number, approximately 20-30, have been 
identified. This analysis suggests that many of these important genes remain to 
be identified. 

Potassium channels are involved in multiple different processes and are 
important regulators of homeostasis in nearly all cell types. Their relevance to 

10 basic cellular physiology and role in many human diseases suggests that 
pharmacological agents could be designed to specific channel subtypes and 
these compounds then applied to a large market (Bulman, D.E. (1997) Hum Mol 
Genet 6:1679-1685; Ackerman, M.J. and Clapham D.E. (1997) NEJM 336:1575- 
1586, Curran, M.E. (1998) Current Opinion in Biotechnology 9:565-572). The 

15 variety of therapeutic agents that modulate K+ channel activity reflects the 
diversity of physiological roles and importance of K+ channels in cellular function. 
A difficulty encountered in therapeutic use of therapeutic agents that modify K+ 
channel activity is that the presently available compounds tend to be non-specific 
and elicit both positive and negative responses, thereby reducing clinical efficacy. 

20 To facilitate development of specific compounds it is desirable to have further 
characterize novel K+ channels for use in in vitro and in vivo assays. 

Relevant Literature 

A large body of literature exists in the general area of potassium channels. 

25 A review of the literature may be found in the series of books, "The Ion Channel 
Factsbook", volumes 1-4, by Edward C. Conley and William J. Brammar, 
Academic Press. An overview is provided of: extracellular ligand-gated ion 
channels (ISBN: 0121844501), intracellular ligand-gated channels (ISBN: 
012184451X), Inward rectifier and intercellular channels (ISBN: 0121844528), 

30 and voltage gated channels (ISBN: 0121844536). Hille, B. (1992) "Ionic 
Channels of Excitabl Membranes", 2 nd Ed. Sunderland MA:Sinauer Associates, 
also reviews potassium channels. 

3 
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Jan and Jan (1997) Annu. Rev. Neurosci . 20:91-123 review cloned 
potassium channels from eukaryotes and prokaryotes. Ackerman and Clapham 
(1997) N. Engl. J. Med . 336:1575-1586 discuss the basic science of ion channels 
in connection with clinical disease. Bulman (1997) Hum. Mol. Genet . 6:1679- 
5 1685 describe some phenotypic variation in ion channel disorders. 

Stephan et a/. (1994) Neurology 44:1915-1920 describe a pedigree 
segregating a myotonia with muscular hypertrophy and hyperirritability as an 
autosomal dominant trait (rippling muscle disease, Ricker et al. (1989) Arch. 
Neurol . 46405-408). Electromyography demonstrated that mechanical stimulation 

10 provoked electrically silent contractions. The responsible gene was localized to 
the distal end of the long arm of chromosome 1 , in a 12-cM region near D1S235. 

Type II pseudohypoaldosteronism is the designation used for a syndrome 
of chronic mineralocorticoid-resistant hyperkalemia with hypertension. The 
primary abnormality in type II PHA is thought to be a specific defect of the renal 

15 secretory mechanism for potassium, which limits the kaliuretic response to, but 
not the sodium and chloride reabsorptive effect of, mineralocorticoid. By analysis 
of linkage in families with autosomal dominant transmission, Mansfield et al. 
(1997) Nature Genet. 16:202-205 demonstrated locus heterogeneity of the trait, 
with linkage of the PHA2 gene to 1q31-q42 and 17p11-q21. 

20 Sequences of four transmembrane, two pore potassium channels have 

been previously described. Reyes et al. (1998) J Biol Chem 273(47):30863- 
30869 discloses a pH sensitive channel. As with the related TASK-1 and TRAAK 
channels, the outward rectification is lost at high external K+ concentration. The 
TRAAK channel is described by Fink ef al. (1998) EMBO J 17(12):3297-308. A 

25 cardiac two-pore channel is described in Kim et al. (1998) Circ Res 82(4):513-8. 
An open rectifier potassium channel with two pore domains in tandem and having 
a postsynaptic density protein binding sequence at the C terminal was cloned by 
Leonoudakis ef al. (1998) J Neurosci 18(3):868-77. 

The electrophysiological properties of Task channels are of interest, 

30 (Duprat et al. (1997) EMBO J 16:5464-71). TASK currents are K+-selective, 
instantaneous and non-inactivating. They show an outward rectification when 
xtemal [K+] is low, which is not observed for high [K+]out suggesting a lack of 

4 
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intrinsic voltage sensitivity. The absence of activation and inactivation kinetics as 
well as voltage independence are characteristic of conductances referred to as 
leak or background conductances. TASK is very sensitive to variations of 
extracellular pH in a narrow physiological range, a property probably essential for 
5 its physiological function, and suggests that small pH variations may serve a 
communication role in the nervous system. 

Summary of the Invention 
Isolated nucleotide compositions and sequences are provided for K+Hnov 
10 genes. The K+Hnov nucleic acid compositions find use in identifying homologous 
or related genes; in producing compositions that modulate the expression or 
function of its encoded proteins; for gene therapy; mapping functional regions of 
the proteins; and in studying associated physiological pathways. In addition, 
modulation of the gene activity in vivo is used for prophylactic and therapeutic 
15 purposes, such as treatment of potassium channel defects, identification of cell 
type based on expression, and the like. 

Description of the Specific Embodiments 
Nucleic acid compositions encoding K+Hnov polypeptides are provided. 

20 They are used in identifying homologous or related genes; in producing 
compositions that modulate the expression or function of the encoded proteins; 
for gene therapy; mapping functional regions of the proteins; and in studying 
associated physiological pathways. The K+Hnov gene products are members of 
the potassium channel gene family, and have high degrees of homology to known 

25 potassium channels. The encoded polypeptides may be alpha subunits, which 
form the functional channel, or accessory subunits that act to modulate the 
channel activity. 

Characterization of K+Hnov t 
30 The sequence data predict that the provided K+Hnov genes encode 

potassium channels. Table 1 summarizes the DNA sequences, corresponding 
SEQ ID IMOs, chromosomal locations, and polymorphisms. The provided 

5 
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sequences may encode a predicted ICchannel, e.g. voltage gated, inward 
rectifier, efc.; or a modulatory subunit. 

Electrophysiologic characterization of ion channels is an important part of 
understanding channel function. Full length ion channel cDNAs may be 
combined with proper vectors to form expression constructs of each individual 
channel. Functional analyses of expressed channels can be performed in 
heterologous systems, or by expression in mammalian cell lines. For expression 
analyses in heterologous systems such as Xenopus oocytes, synthetic mRNA is 
made through in vitro transcription of each channel construct. mRNA is then 
injected, singly or in combination with interacting channel subunit mRNAs, into 
prepared oocytes and the cells allowed to express the channel for several days. 
Oocytes expressing the channel of interest are then analyzed by whole cell 
voltage clamp and patch clamp techniques. 

To determine the properties of each channel when expressed in 
mammalian cells expression vectors specific to this type of analyses may be 
constructed and the resultant construct used to transform the target cells (for 
example human embryonic kidney (HEK) cells). Both stable and transiently 
expressing lines may be studied using whole cell voltage clamp and patch clamp 
techniques. Data obtained from EP studies includes, but is not limited to: current 
profiles elicited by depolarization and hyperpolarization, current-voltage (l-V) 
relationships, voltage dependence of activation, biophysical kinetics of channel 
activation, deactivation, and inactivation, reversal potential, ion selectivity, gating 
properties and sensitivity to channel antagonists and agonists. 

Heterologous or mammalian cell lines expressing the novel channels can 
be used to characterize small molecules and drugs which interact with the 
channel. The same experiments can be used to assay for novel compounds 
which interact with the expressed channels. 

In many cases the functional ion channel formed by K+Hnov polypeptides 
will be heteromultimers. Heteromultimers are known to form between different 
voltage gated, outward rectifying potassium channel a subunits, generally 
comprising four subunits, and frequently associated with auxiliary, p subunits. 
Typically such a subunits share a six-transmembrane domain structure (S1-S6), 

6 
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with one highly positively charged domain (S4) and a pore region situated 
between S5 and S6. Examples of such subunits are K+Hnov4, K+Hnov9, and 
K+Hnov12. Channels are also formed by mutimerization of subunits of the two 
transmembrane and one pore architecture. It is predicted that two subunits of 
5 K+Hnov49 or K+Hnov59 will be required to form a functional channel. 

Heteromultimers of greatest interest are those that form between subunits 
expressed in the same tissues, and are a combination of subunits from the same 
species. In addition, the formation of multimers between the subject polypeptides 
and subunits that form functional channels are of particular interest. The resulting 

10 channel may have decreased or increased conductance relative to a 
homomultimer, and may be altered in response to beta subunits or other 
modulatory molecules. 

Known voltage gated K+ channel a subunits include Kv1.1-1.8 (Gutman et 
a/. (1993) Sem. Neurosci. 5:101-106); Kv2.1-2.2; Kv3.1-3.4; Kv4.1-4.3; Kv5.1; 

15 Kv6.1; Kv7.1; Kv8.1; Kv9.1-9.2. The subunits capable of forming ion inducing 
channels include all of those in the Kv1 through Kv4; and Kv7 families. The 
Kv5.1, Kv6.1 t Kv8.1 and Kv9.1-9.2 subunits may be electrically silent, but 
functional in modifying the properties in heteromultimers. 



7 



WO 99/43696 



PCT/US99/03826 



Q. 



© 

c 
c 

£ 



c 



© 
> 



CL 



o 

I 

<D 

2 
5 



© 
C 
C 
(0 

+ 

ra 
© 



O 

E 



a 



O 

"O 

s g 

E » 

© + 
E* 

s I 



o 

(A 

ii 

C (A 
^ O 
O CL 



CO 



CM 
Q. 



CO 



3 



-C 



o 

UJ 

CO 
c 

i 

a 



O 

LU 

CO 



© 

E 

co 



CO 
CO 
CM 



3 



o 

CL 

© 

> 

S3 
CO 

Eio 

§8 

< CM 



CM 
6 

z 
9 
a 

Ui 

CO 



O 
z 
g 

a 

LU 

(0 



> 

o 
c 

I 



5 



o o 

cm m 

5 P 



o 
o 

O o O o 
N * 2t 5 

§ p i s 



3 CD 
1 I 



CD 



3 

If 



K O H 

5 R s 
8 5 3 



« o 

E 

0) CO 



CO 



CO 

d 
z 



o 

Ui 
CO 



6 

z 
o 

2 

CO 



?! 
o 
c 

I 
+ 



8 



WO 99/43696 



PCT/US99/03826 



Human ortholog of murine gene, 6 
transmembrane dominas, voltage 
gated, delayed rectifier K+ channel 


6 transmembrane domain, voltage 
gated K+ channel 


Modulatory subunit 


Homology to K+ channel protein of 
C. elegans 


c 

3 

-Q 
3 

CO 

2 

<D 
-Q 


4T/2P channel; linked to the 
disease loci for rippling muscle 
disease 1 (RMD1), and type II 
pseudohypoaldosteronism 


4T/2P channel 


















CO 




CO 


N/A 






12q- 




CO 
CM 

cr 

CO 


cr 

GO 




CL 
CM 
CM 




o 




















co « 














5' splices 


*60A; T2496A 






(ATCT) n repeats in the 
UTR sequence, starting 
position 2186 




N/A 






C3168T 




4 alternative 


G1162A; T1< 




V/N 


N/A 


SEQID NO: 18 


SEQ ID NO:20 


SEQ ID NO:25 


SEQ ID NO:27 


SEQ ID NO:30 


SEQ ID NO:81 


SEQ ID NO:83 


IDNO:17 






IDNO:19 




ID NO:21-24 


ID NO:26 




CO 

<y 

GO 
CM 

d 
z 

D 


ID NO:80 


d 
z 

Q 


SEQ 






SEQ 




SEQ 


SEQ 




SEQ 


SEQ 


SEQ I 


1 

X 

i 


K+Hnov 14 


oo 

% 

c 
X 
♦ 


CM 

o 

c i 
X 

+ 'I 


1 

o 
c 
X 

£ 


CO 

s 

c 
X 
k 


CO 

3 

C 

X 

k 



9 



WO 99/43696 



PCT/US99/03826 



K+Hnov nucleic acid compositions 
As used herein, the term a K+Hnov tt is generically used to refer to any one 
of the provided genetic sequences listed in Table 1. Where a specific K+Hnov 
sequence is intended, the numerical designation, e.g. K49 or K59, will be added. 

5 Nucleic acids encoding K+Hnov potassium channels may be cDNA or genomic 
DNA or a fragment thereof. The term u K+Hnov gene" shall be intended to mean 
the open reading frame encoding any of the provided K+Hnov polypeptides, 
introns, as well as adjacent 5' and 3' non-coding nucleotide sequences involved in 
the regulation of expression, up to about 20 kb beyond the coding region, but 

10 possibly further in either direction. The gene may be introduced into an 
appropriate vector for extrachromosomal maintenance or for integration into a 
host genome. 

The term "cDNA" as used herein is intended to include all nucleic acids 
that share the arrangement of sequence elements found in native mature mRNA 

15 species, where sequence elements are exons and 3* and 5* non-coding regions. 
Normally mRNA species have contiguous exons, with the intervening^introns, 
when present, removed by nuclear RNA splicing, to create a continuous open 
reading frame encoding a K+Hnov protein. 

A genomic sequence of interest comprises the nucleic acid present 

20 between the initiation codon and the stop codon, as defined in the listed 
sequences, including all of the introns that are normally present in a native 
chromosome. It may further include the 3* and 5' untranslated regions found in 
the mature mRNA. It may further include specific transcriptional and translational 
regulatory sequences, such as promoters, enhancers, etc., including about 1 kb, 

25 but possibly more, of flanking genomic DNA at either the 5' or 3' end of the 
transcribed region. The genomic DNA may be isolated as a fragment of 100 kbp 
or smaller, and substantially free of flanking chromosomal sequence. The 
genomic DNA flanking the coding region, either 3' or 5\ or internal regulatory 
sequences as sometimes found in introns, contains sequences required for 

30 proper tissue and stage specific expression. 
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The sequence of the 5' flanking region may be utilized for promoter 
elements, including enhancer binding sites, that provide for developmental 
regulation in tissues where K+Hnov genes are expressed. The tissue specific 
expression is useful for determining the pattern of expression, and for providing 
5 promoters that mimic the native pattern of expression. Naturally occurring 
polymorphisms in the promoter regions are useful for determining natural 
variations in expression, particularly those that may be associated with disease. 

Alternatively, mutations may be introduced into the promoter regions to 
determine the effect of altering expression in experimentally defined systems. 

10 Methods for the identification of specific DNA motifs involved in the binding of 
transcriptional factors are known in the art, e.g. sequence similarity to known 
binding motifs, gel retardation studies, ete. For examples, see Blackwell et a/. 
(1995) Mol Med 1: 194-205; Mortlock ef a/. (1996) Genome Res. 6: 327-33; and 
Joulin and Richard-Foy (1995) Eur J Biochem 232: 620-626. 

15 The regulatory sequences may be used to identify cis acting sequences 

required for transcriptional or translational regulation of K+Hnov expression, 
especially in different tissues or stages of development, and to identify cis acting 
sequences and trans acting factors that regulate or mediate K+Hnov expression. 
Such transcription or translational control regions may be operably linked to a 

20 K+Hnov gene in order to promote expression of wild type or altered K+Hnov or 
other proteins of interest in cultured cells, or in embryonic, fetal or adult tissues, 
and for gene therapy. 

The nucleic acid compositions of the subject invention may encode all or a 
part of the subject polypeptides. Double or single stranded fragments may be 

25 obtained of the DNA sequence by chemically synthesizing oligonucleotides in 
accordance with conventional methods, by restriction enzyme digestion, by PCR 
amplification, ete. For the most part, DNA fragments will be of at least 15 nt, 
usually at least 18 nt or 25 nt, and may be at least about 50 nt Such small DNA 
fragments are useful as primers for PCR, hybridization screening probes, efc. 

30 Larger DNA fragments, i.e. greater than 100 nt are useful for production of the 
ncoded polypeptide. For use in amplification reactions, such as PCR, a pair of 
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primers will b used. The exact composition of the primer sequences is not 
critical to the invention, but for most applications the primers will hybridize to the 
subject sequence under stringent conditions, as known in the art. It is preferable 
to choose a pair of primers that will generate an amplification product of at least 
5 about 50 nt, preferably at least about 100 nt. Algorithms for the selection of 
primer sequences are generally known, and are available in commercial software 
packages. Amplification primers hybridize to complementary strands of DNA, and 
will prime towards each other. 

The K+Hnov genes are isolated and obtained in substantial purity, 

10 generally as other than an intact chromosome. Usually, the DNA will be obtained 
substantially free of other nucleic add sequences that do not include a K+Hnov 
sequence or fragment thereof, generally being at least about 50%, usually at least 
about 90% pure and are typically "recombinant", /.e. flanked by one or more 
nucleotides with which it is not normally associated on a naturally occurring 

15 chromosome. 

The DNA may also be used to identify expression of the gene in a 
biological specimen. The manner in which one probes cells for the presence of 
particular nucleotide sequences, as genomic DNA or RNA, is well established in 
the literature and does not require elaboration here. DNA or mRNA is isolated 

20 from a cell sample. The mRNA may be amplified by RT-PCR, using reverse 
transcriptase to form a complementary DNA strand, followed by polymerase chain 
reaction amplification using primers specific for the subject DNA sequences. 
Alternatively, the mRNA sample is separated by gel electrophoresis, transferred 
to a suitable support, e.g. nitrocellulose, nylon, efc., and then probed with a 

25 fragment of the subject DNA as a probe. Other techniques, such as 
oligonucleotide ligation assays, in situ hybridizations, and hybridization to DNA 
probes arrayed on a solid chip may also find use. Detection of mRNA hybridizing 
to the subject sequence is indicative of K+Hnov gene expression in the sample. 
The sequence of a K+Hnov gene, including flanking promoter regions and 

30 coding regions, may be mutated in various ways known in the art to gen rate 
targ ted changes in promoter strength, sequence of the encoded prot in, efc. 
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The DNA sequence or protein product of such a mutation will usually be 
substantially similar to the sequences provided herein, i.e. will differ by at least 
one nucleotide or amino acid, respectively, and may differ by at least two but not 
more than about ten nucleotides or amino acids. The sequence changes may be 
5 substitutions, insertions or deletions. Deletions may further include larger 
changes, such as deletions of a domain or exon. Other modifications of interest 
include epitope tagging, e.g. with the FLAG system, HA, efc. For studies of 
subcellular localization, fusion proteins with green fluorescent proteins (GFP) may 
be used. 

10 Techniques for in vitro mutagenesis of cloned genes are known. Examples 

of protocols for site specific mutagenesis may be found in Gustin ef a/., 
Biotechniques 14:22 (1993); Barany, Gene 37:111-23 (1985); Colicelli et a/., Mol 
Gen Genet 199:537-9 (1985); and Prentki et a/., Gene 29:303-13 (1984). 
Methods for site specific mutagenesis can be found in Sambrook et a/., Molecular 

15 Cloning: A Laboratory Manual, CSH Press 1989, pp. 15.3-15.108; Weiner et a/., 
Gene 126:35-41 (1993); Sayers etal. % Biotechniques 13:592-6 (1992); Jones and 
Winistorfer, Biotechniques 12:528-30 (1992); Barton et a/., Nucleic Acids Res 
18:7349-55 (1990); Marotti and Tomich, Gene Anal Tech 6:67-70 (1989); and 
Zhu, AnalBiochem 177:120-4 (1989). Such mutated genes may be used to study 

20 structure-function relationships of K+Hnov, or to alter properties of the protein that 
affect its function or regulation. 

Homologs and orthologs of K+Hnov genes are identified by any of a 
number of methods. A fragment of the provided cDNA may be used as a 
hybridization probe against a cDNA library from the target organism of interest, 

25 where low stringency conditions are used. The probe may be a large fragment, or 
one or more short degenerate primers. Nucleic acids having sequence similarity 
are detected by hybridization under low stringency conditions, for example, at 
50°C and 6XSSC (0.9 M sodium chloride/0.09 M sodium citrate) and remain 
bound when subjected to washing at 55°C in 1XSSC (0.15 M sodium 

30 chloride/0.015 M sodium citrate). Sequence identity may be determined by 
hybridization under stringent conditions, for example, at 50°C or higher and 
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0.1XSSC (15 mM sodium chloride/01.5 mM sodium citrate). Nucleic acids having 
a region of substantial identity to the provided K+Hnov sequences, e.g. allelic 
variants, genetically altered versions of the gene, efc., bind to the provided 
K+Hnov sequences under stringent hybridization conditions. By using probes, 

5 particularly labeled probes of DNA sequences, one can isolate homologous or 
related genes. The source of homologous genes may be any species, e.g. 
primate species, particularly human; rodents, such as rats and mice, canines, 
felines, bovines, ovines, equines, yeast, nematodes, efc. 

Between mammalian species, e.g. human and mouse, homologs have 

10 substantial sequence similarity, i.e. at least 75% sequence identity between 
nucleotide sequences, in some cases 80 or 90% sequence identity, and may be 
as high as 95% sequence identity between closely related species. Sequence 
similarity is calculated based on a reference sequence, which may be a subset of 
a larger sequence, such as a conserved motif, coding region, flanking region, efc. 

15 A reference sequence will usually be at least about 18 nt long, more usually at 
least about 30 nt long, and may extend to the complete sequence that is being 
compared. Algorithms for sequence analysis are known in the art, such as 
BLAST, described in Altschul et al. (1990), J. Mol. Biol. 215:403-10. In general, 
variants of the invention have a sequence identity greater than at least about 

20 65%, preferably at least about 75%, more preferably at least about 85%, and may 
be greater than at least about 90% or more as determined by the Smith- 
Waterman homology search algorithm as implemented in MPSRCH program 
(Oxford Molecular). Exemplary search parameters for use with the MPSRCH 
program in order to identify sequences of a desired sequence identity are as 

25 follows: gap open penalty: 12; and gap extension penalty: 1 . 

K+Hnov Polypeptides 
The subject nucleic acid sequences may be employed for producing all or 
portions of K+Hnov polypeptides. For expression, an expression cassette may be 
30 employed. The expression vector will provide a transcriptional and translational 
initiation region, which may be inducible or constitutive, where the coding region 
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is operably linked under the transcriptional control of the transcriptional initiation 
region, and a transcriptional and translational termination region. These control 
regions may be native to a K+Hnov gene, or may be derived from exogenous 
sources. 

5 The peptide may be expressed in prokaryotes or eukaryotes in accordance 

with conventional ways, depending upon the purpose for expression. For large 
scale production of the protein, a unicellular organism, such as E. coll B. subtilis, 
S. cerevisiae, insect cells in combination with baculovirus vectors, or cells of a 
higher organism such as vertebrates, particularly mammals, e.g. COS 7 cells, 

10 may be used as the expression host cells. In some situations, it is desirable to 
express the K+Hnov gene in eukaryotic cells, where the K+Hnov protein will 
benefit from native folding and post-translational modifications. Small peptides 
can also be synthesized in the laboratory. Peptides that are subsets of the 
complete K+Hnov sequence may be used to identify and investigate parts of the 

15 protein important for function, or to raise antibodies directed against these 
regions. 

Fragments of interest include the transmembrane and pore domains, the 
signal sequences, regions of interaction between subunits, etc. Such domains 
will usually include at least about 20 amino acids of the provided sequence, more 

20 usually at least about 50 amino acids, and may include 100 amino acids or more, 
up to the complete domain. Binding contacts may be comprised of non- 
contiguous sequences, which are brought into proximity by the tertiary structure of 
the protein. The sequence of such fragments may be modified through 
manipulation of the coding sequence, as described above. Truncations may be 

25 performed at the carboxy or amino terminus of the fragment, e.g. to determine the 
minimum sequence required for biological activity. 

With the availability of the protein or fragments thereof in large amounts, by 
employing an expression host, the protein may be isolated and purified in 
accordance with conventional ways. A lysate may be prepared of the expression 

30 host and the lysate purified using HPLC, exclusion chromatography, gel 
electrophoresis, affinity chromatography, or oth r purification technique. The 
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purified protein will generally be at least about 80% pure, preferably at least about 
90% pure, and may be up to and including 100% pure. Pure is intended to mean 
free of other proteins, as well as cellular debris. 

The expressed K+Hnov polypeptides are useful for the production of 

5 antibodies, where short fragments provide for antibodies specific for the particular 
polypeptide, and larger fragments or the entire protein allow for the production of 
antibodies over the surface of the polypeptide. Antibodies may be raised to the 
wild-type or variant forms of K+Hnov. Antibodies may be raised to isolated 
peptides corresponding to specific domains, e.g. the pore domain and the 

10 transmembrane domain, or to the native protein. 

Antibodies are prepared in accordance with conventional ways, where the 
expressed polypeptide or protein is used as an immunogen, by itself or 
conjugated to known immunogenic carriers, e.g. KLH, pre-S HBsAg, other viral or 
eukaryotic proteins, or the like. Various adjuvants may be employed, with a 

15 series of injections, as appropriate. For monoclonal antibodies, after one or more 
booster injections, the spleen is isolated, the lymphocytes immortalized by cell 
fusion, and then screened for high affinity antibody binding. The immortalized 
cells, i.e. hybridomas, producing the desired antibodies may then be expanded. 
For further description, see Monoclonal Antibodies: A Laboratory Manual . Harlow 

20 and Lane eds., Cold Spring Harbor Laboratories, Cold Spring Harbor, New York, 
1988. If desired, the mRNA encoding the heavy and light chains may be isolated 
and mutagenized by cloning in E. co//, and the heavy and light chains mixed to 
further enhance the affinity of the antibody. Alternatives to in vivo immunization 
as a method of raising antibodies include binding to phage "display" libraries, 

25 usually in conjunction with in vitro affinity maturation. 

K+Hnov Genotyping 
The subject nucleic acid and/or polypeptide compositions may be used to 
genotyping and other analysis for the presence of polymorphisms in the 
30 sequence, or variation in the expression of the subject genes. Genotyping may 
b performed to det rmine whether a particular polymorphisms is associated with 
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a disease state or genetic predisposition to a disease state, particularly diseases 
associated with defects in excitatory properties of cells, e.g. cardiac, muscle, renal 
and neural cells. Disease of interest include rippling muscle disease, and type II 
psuedohypoaldosteronism. 

Clinical disorders associated with K+ channel defects include long-QT 
syndrome; a congenital disorder affecting 1 in 10,000-15.000. Affected 
individuals have a prolonged QT interval in the electrocardiogram due to a 
delayed repolarization of the ventricle. Genetic linkage analyses identified two loci 
for long QT syndrome, LQT1, in 11p15.5 and LQT2, in 7q35-36. Positional 
cloning techniques identified the novel K+ channel KvLQTI on chromosome 11 
while candidate gene analysis identified causative mutations in the HERG K+ 
channel for LQT2. 

The weaver mouse exhibits several abnormal neurological symptoms, 
including severe ataxia, loss of granule cell neurons in the cerebellum and 
15 dopaminergic cells in the substantia nigra, as well as seizures and male infertility. 
A Gi>rotein-coupled K+ channel having a mutation in the conserved pore domain 
has been determined to cause the disease. The pahcreatic-islet B-cell ATP- 
sensitive K+ channel (KATP) is composed of two subunits, the sulfonylurea 
receptor (SUR) and the inward rectifier K+ channel Kir6.2. Mutations in both SUR 
20 and Kir6.2 have been identified in patients with persistent hyperinsulinemic 
hypoglycemia of infancy, which is caused by unregulated secretion of insulin. 

Genotyping may also be performed for pharmacogenetic analysis to 
assess the association between an individual's genotype and that individual's 
ability to react to a therapeutic agent Differences in target sensitivity can lead to 
25 toxicity or therapeutic failure. Relationships between polymorphisms in channel 
expression or specificity can be used to optimize therapeutic dose administration. 

Genetic polymorphisms are identified in the K+Hnov gene (examples are 
listed in table 1), e.g. the repeat variation in the 3' UTR of K40. Nucleic acids 
comprising the polymorphic sequences are used to screen patients for altered 
30 reactivity and adverse side effects in response to drugs that act on K+ channels. 
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K+Hnov genotyping is performed by DNA or RNA sequence and/or 
hybridization analysis of any convenient sample from a patient, e.g. biopsy 
material, blood sample, scrapings from cheek, etc. A nucleic acid sample from an 
individual is analyzed for the presence of polymorphisms in K+Hnov, particularly 

5 those that affect the activity, responsiveness or expression of K+Hnov. Specific 
sequences of interest include any polymorphism that leads to changes in basal 
expression in one or more tissues, to changes in the modulation of K+Hnov 
expression, or alterations in K+Hnov specificity and/or activity. 

The effect of a polymorphism in K+Hnov gene sequence on the response 

10 to a particular agent may be determined by in vitro or in vivo assays. Such 
assays may include monitoring during clinical trials, testing on genetically defined 
cell lines, etc. The response of an individual to the agent can then be predicted 
by determining the K+Hnov genotype with respect to the polymorphism. Where 
there is a differential distribution of a polymorphism by racial background, 

15 guidelines for drug administration can be generally tailored to a particular ethnic 
group. 

Biochemical studies may be performed to determine whether a sequence 
polymorphism in a K+Hnov coding region or control regions is associated with 
disease, for example the association of K+Hnov 9 with idiopathic generalized 

20 epilepsy. Disease associated polymorphisms may include deletion or truncation 
of the gene, mutations that alter expression level, that affect the electrical activity 
of the channel, efc. 

A number of methods are available for analyzing nucleic acids for the 
presence of a specific sequence. Where large amounts of DNA are available, 

25 genomic DNA is used directly. Alternatively, the region of interest is cloned into a 
suitable vector and grown in sufficient quantity for analysis. The nucleic acid may 
be amplified by conventional techniques, such as the polymerase chain reaction 
(PCR), to provide sufficient amounts for analysis. The use of the polymerase 
chain reaction is described in Saiki etal. (1985) Science 239:487, and a review of 

30 current techniques may be found in Sambrook et al. Molecular Cloning: A 
Laboratory Manual, CSH Press 1989, pp.14.2-14.33. Amplification may b used 
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to determine whether a polymorphism is present, by using a primer that is specific 
for the polymorphism. Alternatively, various methods are known in the art that 
utilize oligonucleotide ligation as a means of detecting polymorphisms, for 
examples see Riley ef a/. (1990) NAR, 18:2887-2890; and Delahunty ef a/. 
(1996) Am J. Hum. Genet .58:1239-1246. 

A detectable label may be included in an amplification reaction. Suitable 
labels include fluorochromes, e.g. fluorescein isothiocyanate (FITC). rhodamine, 
Texas Red, phycoerythrin, allophycocyanin. 6-carboxyfluorescein (6-FAM), 
Z.y-dimethoxy^'.S'- dichloro-6-carboxyfluorescein (JOE), 6-carboxy-X-rhodamine 
(ROX), 6-carboxy-2 , .4 , ,7 , ,4.7- hexachlorofluorescein (HEX). 5-carboxyfluorescein 
(5-FAM) or N.N.N'.N'-tetramethyl-e- carboxyrhodamine (TAMRA), radioactive 
labels, e.g. 32P, 35S, 3H; etc. The label may be a two stage system, where the 
amplified DNA is conjugated to biotin, haptens, etc. having a high affinity binding 
partner, e.g. avidin, specific antibodies, etc., where the binding partner is 
conjugated to a detectable label. The label may be conjugated to one or both of 
the primers. Alternatively, the pool of nucleotides used in the amplification is 
labeled, so as to incorporate the label into the amplification product 

The sample nucleic acid, e.g. amplified or cloned fragment, is analyzed by 
one of a number of methods known in the art. The nucleic acid may be 
sequenced by dideoxy or other methods. Hybridization with the variant sequence 
may also be used to determine its presence, by Southern blots, dot blots, etc. 
The hybridization pattern of a control and variant sequence to an array of 
oligonucleotide probes immobilised on a solid support, as described in U.S. 
5,445,934, or in WO95/35505, may also be used as a means of detecting the 
presence of variant sequences. Single strand conformational polymorphism 
(SSCP) analysis, denaturing gradient gel electrophoresis (DGGE), mismatch 
cleavage detection, and heteroduplex analysis in gel matrices are used to detect 
conformational changes created by DNA sequence variation as alterations in 
electrophoretic mobility. Alternatively, where a polymorphism creates or destroys 
a recognition site for a restriction endonuclease (restriction fragment length 
polymorphism, RFLP). the sample is digested with that endonuclease, and the 
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products size fractionated to determine whether the fragment was digested. 
Fractionation is performed by gel or capillary electrophoresis, particularly 
acrylamide or agarose gels. 

In one embodiment of the invention, an array of oligonucleotides are 

5 provided, where discrete positions on the array are complementary to one or 
more of the provided sequences, e.g. oligonucleotides of at least 12 nt, frequently 
20 nt, or larger, and including the sequence flanking a polymorphic position in a 
iCHnov sequence; coding sequences for different K + Hnov channels, panels of ion 
channels comprising one or more of the provided K* channels; efc. Such an array 

10 may comprise a series of oligonucleotides, each of which can specifically 
hybridize to a different polymorphism. For examples of arrays, see Hacia et a/. 
(1996) Nature Genetics 14:441-447; Lockhart et a/. (1996) Nature Biotechnol. 
14:1675-1680; and De Risi etal. (1996) Nature Genetics 14:457-460. 

Screening for polymorphisms in K+Hnov may be based on the functional or 

15 antigenic characteristics of the protein. Protein truncation assays are useful in 
detecting deletions that may affect the biological activity of the protein. Various 
immunoassays designed to detect polymorphisms in K+Hnov proteins may be 
used in screening. Where many diverse genetic mutations lead to a particular 
disease phenotype, functional protein assays have proven to be effective 

20 screening tools. The activity of the encoded K+Hnov protein as a potassium 
channel may be determined by comparison with the wild-type protein. 

Antibodies specific for a K+Hnov may be used in staining or in 
immunoassays. Samples, as used herein, include biological fluids such as 
semen, blood, cerebrospinal fluid, tears, saliva, lymph, dialysis fluid and the like; 

25 organ or tissue culture derived fluids; and fluids extracted from physiological 
tissues. Also included in the term are derivatives and fractions of such fluids. 
The cells may be dissociated, in the case of solid tissues, or tissue sections may 
be analyzed. Alternatively a lysate of the cells may be prepared. 

Diagnosis may be performed by a number of methods to determine the 

30 absence or presence or altered amounts of normal or abnormal K+Hnov 
polypeptides in patient cells. For exampl , detection may utilize staining of cells 
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or histological sections, performed in accordance with conventional methods. 
The antibodies of interest are added to the cell sample, and incubated for a period 
of time sufficient to allow binding to the epitope, usually at least about 10 minutes. 
The antibody may be labeled with radioisotopes, enzymes, fluoresces, 
5 chemiluminescers, or other labels for direct detection. Alternatively, a second 
stage antibody or reagent is used to amplify the signal. Such reagents are well 
known in the art. For example, the primary antibody may be conjugated to biotin, 
with horseradish peroxidase-conjugated avidin added as a second stage reagent. 
Alternatively, the secondary antibody conjugated to a flourescent compound, e.g. 
10 flourescein, rhodamine, Texas red, efc. Final detection uses a substrate that 
undergoes a color change in the presence of the peroxidase. The absence or 
presence of antibody binding may be determined by various methods, including 
flow cytometry of dissociated cells, microscopy, radiography, scintillation counting, 
etc. 

15 

Modulation of Gene Expression 
The K+Hnov genes, gene fragments, or the encoded protein or protein 
fragments are useful in gene therapy to treat disorders associated with K+Hnov 
defects. Expression vectors may be used to introduce the K+Hnov gene into a 

20 cell. Such vectors generally have convenient restriction sites located near the 
promoter sequence to provide for the insertion of nucleic acid sequences. 
Transcription cassettes may be prepared comprising a transcription initiation 
region, the target gene or fragment thereof, and a transcriptional termination 
region. The transcription cassettes may be introduced into a variety of vectors, 

25 e.g. plasmid; retrovirus, e.g. lentivirus; adenovirus; and the like, where the vectors 
are able to transiently or stably be maintained in the cells, usually for a period of 
at least about one day, more usually for a period of at least about several days to 
several weeks. 

The gene or K+Hnov protein may be introduced into tissues or host cells 
30 by any number of routes, including viral infection, microinjection, or fusion of 
v sides. Jet injection may also b used for intramuscular administration, as 
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described by Furth etal. (1992) Anal Biochem 205:365-368. The DNA may be 
coated onto gold microparticies, and delivered intradermally by a particle 
bombardment device, or "gene gun" as described in the literature (see, for 
example, Tang etaL (1992) Nature 356:152-154), where gold microprojectiles are 
coated with the K+Hnov or DNA, then bombarded into skin cells. 

Antisense molecules can be used to down-regulate expression of K+Hnov 
in cells. The anti-sense reagent may be antisense oligonucleotides (ODN), 
particularly synthetic ODN having chemical modifications from native nucleic 
acids, or nucleic acid constructs that express such anti-sense molecules as RNA. 
The antisense sequence is complementary to the mRNA of the targeted gene, 
and inhibits expression of the targeted gene products. Antisense molecules 
inhibit gene expression through various mechanisms, e.g. by reducing the amount 
of mRNA available for translation, through activation of RNAse H, or steric 
hindrance. One or a combination of antisense molecules may be administered, 
where a combination may comprise multiple different sequences. 

Antisense molecules may be produced by expression of all or a part of the 
target gene sequence in an appropriate vector, where the transcriptional initiation 
is oriented such that an antisense strand is produced as an RNA molecule. 
Alternatively, the antisense molecule is a synthetic oligonucleotide. Antisense 
oligonucleotides will generally be at least about 7, usually at least about 12, more 
usually at least about 20 nucleotides in length, and not more than about 500, 
usually not more than about 50, more usually not more than about 35 nucleotides 
in length, where the length is governed by efficiency of inhibition, specificity, 
including absence of cross-reactivity, and the like. It has been found that short 
oligonucleotides, of from 7 to 8 bases in length, can be strong and selective 
inhibitors of gene expression (see Wagner et a/. (1996) Nature Biotechnology 
14:840-844). 

A specific region or regions of the endogenous sense strand mRNA 
sequence is chosen to be complemented by the antisense sequence. Selection 
of a specific sequence for the oligonucleotide may us an empirical method, 
where several candidate sequences are assayed for inhibition of expression of 
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the target gene in an in vitro or animal model. A combination of sequences may 
also be used, where several regions of the mRNA sequence are selected for 
antisense complementation. 

Antisense oligonucleotides may be chemically synthesized by methods 

5 known in the art (see Wagner et a/. (1993) supra, and Milligan et a/., supra.) 
Preferred oligonucleotides are chemically modified from the native 
phosphodiester structure, in order to increase their intracellular stability and 
binding affinity. A number of such modifications have been described in the 
literature, which alter the chemistry of the backbone, sugars or heterocyclic 

10 bases. 

Among useful changes in the backbone chemistry are phosphorothioates; 
phosphorodithioates, where both of the non-bridging oxygens are substituted with 
sulfur, phosphoroamidites; alkyl phosphotriesters and boranophosphates. Achiral 
phosphate derivatives include S'-O'-S'-S-phosphorothioate, 3'-S-5'-0- 

15 phosphorothioate, S'-C^-S'-O-phosphonate and S'-NH-S'-O-phosphoroamidate. 
Peptide nucleic acids replace the entire ribose phosphodiester backbone with a 
peptide linkage. Sugar modifications are also used to enhance stability and 
affinity. The a-anomer of deoxyribose may be used, where the base is inverted 
with respect to the natural p-anomer. The 2 , -OH of the ribose sugar may be 

20 altered to form 2*-0-methyl or 2'-0-allyl sugars, which provides resistance to 
degradation without comprising affinity. Modification of the heterocyclic bases 
must maintain proper base pairing. Some useful substitutions include 
deoxyuridine for deoxythymidine; 5-methyl-2'-deoxycytidine and 5-bromo-2'- 
deoxycytidine for deoxycytidine. 5- propynyl-2 , -deoxyuridine and 5-propynyl-2'- 

25 deoxycytidine have been shown to increase affinity and biological activity when 
substituted for deoxythymidine and deoxycytidine, respectively. 

As an alternative to anti-sense inhibitors, catalytic nucleic acid compounds, 
e.g. ribozymes, anti-sense conjugates, ete. may be used to inhibit gene 
expression. Ribozymes may be synthesized in vitro and administered to the 

30 patient, or may be ncoded on an expression v ctor, from which the ribozyme is 
synthesized in th targeted cell (for example, see International patent application 
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WO 9523225, and Beigelman et al. (1995) Nucl. Acids Res 23:4434-42). 
Examples of oligonucleotides with catalytic activity are described in WO 9506764. 
Conjugates of anti-sense ODN with a metal complex, e.g. terpyridylCu(ll), capable 
of mediating mRNA hydrolysis are described in Bashkin et al. (1995) Abe! 
5 Biochem Biotechnol 54:43-56. 

Genetically Altered Cell or Animal Models for K+Hnov Function 
The subject nucleic acids can be used to generate transgenic animals or 
site specific gene modifications in cell lines. Transgenic animals may be made 

10 through homologous recombination, where the normal K+Hnov locus is altered. 
Alternatively, a nucleic acid construct is randomly integrated into the genome. 
Vectors for stable integration include plasmids, retroviruses and other animal 
viruses, YACs, and the like. 

The modified cells or animals are useful in the study of K+Hnov function 

15 and regulation. For example, a series of small deletions and/or substitutions may 
be made in the K+Hnov gene to determine the role of different transmembrane 
domains in forming multimeric structures, ion channels, ete. Of interest are the 
use of K+Hnov to construct transgenic animal models for epilepsy and other 
neurological defects, where expression of K+Hnov is specifically reduced or 

20 absent. Specific constructs of interest include anti-sense K+Hnov, which will 
block K+Hnov expression, expression of dominant negative K+Hnov mutations, 
ete. One may also provide for expression of the K+Hnov gene or variants thereof 
in cells or tissues where it is not normally expressed or at abnormal times of 
development. 

25 DNA constructs for homologous recombination will comprise at least a 

portion of the K+Hnov gene with the desired genetic modification, and will include 
regions of homology to the target locus. DNA constructs for random integration 
need not include regions of homology to mediate recombination. Conveniently, 
markers for positive and negative selection are included. Methods for generating 

30 cells having targeted gene modifications through homologous recombination are 
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known in the art. For various techniques for transfecting mammalian cells, see 
Keown etal. (1990) Methods in Enzvmoloqy 185:527-537. 

For embryonic stem (ES) cells, an ES cell line may be employed, or 
embryonic cells may be obtained freshly from a host, e.g. mouse, rat, guinea pig, 
5 etc. Such cells are grown on an appropriate fibroblast-feeder layer or grown in 
the presence of leukemia inhibiting factor (LIF). When ES or embryonic cells 
have been transformed, they may be used to produce transgenic animals. After 
transformation, the cells are plated onto a feeder layer in an appropriate medium. 
Cells containing the construct may be detected by employing a selective medium. 

10 After sufficient time for colonies to grow, they are picked and analyzed for the 
occurrence of homologous recombination or integration of the construct. Those 
colonies that are positive may then be used for embryo manipulation and 
blastocyst injection. Blastocysts are obtained from 4 to 6 week old superovulated 
females. The ES cells are trypsinized, and the modified cells are injected into the 

15 blastocoel of the blastocyst. After injection, the blastocysts are returned to each 
uterine horn of pseudopregnant females. Females are then allowed to go to term 
and the resulting offspring screened for the construct. By providing for a different 
phenotype of the blastocyst and the genetically modified cells, chimeric progeny 
can be readily detected. 

20 The chimeric animals are screened for the presence of the modified gene 

and males and females having the modification are mated to produce 
homozygous progeny. If the gene alterations cause lethality at some point in 
development, tissues or organs can be maintained as allogeneic or congenic 
grafts or transplants, or in in vitro culture. The transgenic animals may be any 
' 25 non-human mammal, such as laboratory animals, domestic animals, etc. The 
transgenic animals may be used in functional studies, drug screening, e/c, e.g. to 
determine the effect of a candidate drug on Ras or related gene activation, 
oncogenesis, etc. 
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Testing of K+Hnov Function and Responses 
Potassium channels such as K+Hnov polypeptides are involved in multiple 
biologically important processes. Pharmacological agents designed to affect only 
specific channel subtypes are of particular interest. Presently available 
5 compounds tend to be non-specific and elicit both positive and negative 
responses, thereby reducing clinical efficacy. 

The subject polypeptides may be used in in vitro and in vivo models to test 
the specificity of novel compounds, and of analogs and derivatives of compounds 
known to act on potassium channels. Numerous pharmacological agents have 

10 profound affects on K+ channel activity. As examples, Sotalol (BETAPACE) is a 
class III antiarrhythmic drug that prolongs cardiac action potentials by inhibiting 
delayed rectifier K+ channels. Sulfonylurea drugs, such as Glipizide 
(GLUCOTROL) and Tolazamide (TOLAMIDE) function as antidiabetic drugs by 
blocking ATP-sensitive K+ channels present in pancreatic islet cells, thereby 

15 regulating insulin secretion. Diazoxide (HYPERSTAT IV) is an antihypertensive 
drug that activates ATP-sensitive K+ channels, resulting in the relaxation of 
vascular smooth muscle. There are several other examples of drugs that have 
antidiabetic, antihypertensive, or antiarrhythmic activities. A number of drugs that 
activate K+ channels that have been proposed as coronary vasodilators for the 

20 treatment of both vasospastic and chronic stable angina. 

The availability of multiple K+ channel subunits allows in vitro reconstruction of 
functional channels, which may comprise different alpha and beta subunits. The 
individual components may be modified by sequence deletion, substitution, efc. to 
determine the functional role of specific domains. 

25 Drug screening may be performed using an in vitro model, a genetically 

altered cell or animal, or purified K+Hnov protein, either as monomers, 
homomultimers or hetermultimers. One can identify ligands or substrates that 
bind to, modulate or mimic the action of K+Hnov. Drug screening identifies 
agents that provide a replacement for K+Hnov function in abnormal cells. Of 

30 particular interest are screening assays for agents that have a low toxicity for 
human cells. A wide variety of assays may be used for this purpose, including 
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monitoring cellular excitation and conductance, labeled in vitro protein-protein 
binding assays, electrophoretic mobility shift assays, immunoassays for protein 
binding, and the like. The purified protein may also be used for determination of 
three-dimensional crystal structure, which can be used for modeling 
5 intermodular interactions. 

The term "agent" as used herein describes any molecule, e.g. protein or 
pharmaceutical, with the capability of altering or mimicking the physiological 
function of K+Hnov polypeptide. Generally a plurality of assay mixtures are run in 
parallel with different agent concentrations to obtain a differential response to the 

10 various concentrations. Typically, one of these concentrations serves as a 
negative control, i.e. at zero concentration or below the level of detection. 

Candidate agents encompass numerous chemical classes, though typically 
they are organic molecules, preferably small organic compounds having a 
molecular weight of more than 50 and less than about 2,500 daltons. Candidate 

15 agents comprise functional groups necessary for structural interaction with 
proteins, particularly hydrogen bonding, and typically include at least an amine, 
carbonyl, hydroxyl or carboxyl group, preferably at least two of the functional 
chemical groups. The candidate agents often comprise cyclical carbon or 
heterocyclic structures and/or aromatic or polyaromatic structures substituted with 

20 one or more of the above functional groups. Candidate agents are also found 
among biomolecules including peptides, saccharides, fatty acids, steroids, 
purines, pyrimidines, derivatives, structural analogs or combinations thereof. 

Candidate agents are obtained from a wide variety of sources including 
libraries of synthetic or natural compounds. For example, numerous means are 

25 available for random and directed synthesis of a wide variety of organic 
compounds and biomolecules, including expression of randomized 
oligonucleotides and oligopeptides. Alternatively, libraries of natural compounds 
in the form of bacterial, fungal, plant and animal extracts are available or readily 
produced. Additionally, natural or synthetically produced libraries and compounds 

30 are readily modified through conventional chemical, physical and biochemical 
m ans, and may be used to produce combinatorial librari s. Known 
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pharmacological agents may be subjected to directed or random chemical 
modifications, such as acylation, alkylation, esterification, amidification, efc. to 
produce structural analogs. 

Where the screening assay is a binding assay, one or more of the 

5 molecules may be joined to a label, where the label can directly or indirectly 
provide a detectable signal. Various labels include radioisotopes, fluoresces, 
chemiluminescers, enzymes, specific binding molecules, particles, e.g. magnetic 
particles, and the like. Specific- binding molecules include pairs, such as biotin 
and streptavidin, digoxin and antidigoxin etc. For the specific binding members, 

10 the complementary member would normally be labeled with a molecule that 
provides for detection, in accordance with known procedures. 

A variety of other reagents may be included in the screening assay. These 
include reagents like salts, neutral proteins, e.g. albumin, detergents, etc that are 
used to facilitate optimal protein-protein binding and/or reduce non-specific or 

15 background interactions. Reagents that improve the efficiency of the assay, such 
as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc. may be 
used. The mixture of components are added in any order that provides for the 
requisite binding. Incubations are performed at any suitable temperature, 
typically between 4 and 40°C. Incubation periods are selected for optimum 

20 activity, but may also be optimized to facilitate rapid high-throughput screening. 
Typically between 0.1 and 1 hours will be sufficient. 

The compounds having the desired pharmacological activity may be 
administered in a physiologically acceptable carrier to a host in a variety of ways, 
orally, topically, parenterally e.g. subcutaneously, intraperitoneally, by viral 

25 infection, intravascularly, etc. Depending upon the manner of introduction, the 
compounds may be formulated in a variety of ways. The concentration of 
therapeutically active compound in the formulation may vary from about 
0.1-100 wt.%. The pharmaceutical compositions can be prepared in various 
forms, such as granules, tablets, pills, suppositories, capsules, suspensions, 

30 salves, lotions and the lik . Pharmaceutical grade organic or inorganic carriers 
and/or diluents suitable for oral and topical use can be used to make up 
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compositions containing the therapeutically-active compounds. Diluents known to 
the art include aqueous media, vegetable and animal oils and fats. Stabilizing 
agents, wetting and emulsifying agents, salts for varying the osmotic pressure or 
buffers for securing an adequate pH value, and skin penetration enhancers can 
5 be used as auxiliary agents. 

It is to be understood that this invention is not limited to the particular 
methodology, protocols, cell lines, animal species or genera, and reagents 
described, as such may vary. It is also to be understood that the terminology 

10 used herein is for the purpose of describing particular embodiments only, and is 
not intended to limit the scope of the present invention which will be limited only 
by the appended claims. 

As used herein the singular forms "a", "and", and "the" include plural 
referents unless the context clearly dictates otherwise. Thus, for example, 

15 reference to "a cell" includes a plurality of such cells and reference to "the cell" 
includes reference to one or more cells and equivalents thereof known to those 
skilled in the art, and so forth. All technical and scientific terms used herein have 
the same meaning as commonly understood to one of ordinary skill in the art to 
which this invention belongs unless clearly indicated otherwise. 

20 It must be noted that as used herein and in the appended claims, the 

singular forms "a", "and", and "the" include plural referents unless the context 
clearly dictates otherwise. Thus, for example, reference to "a complex" includes a 
plurality of such complexes and reference to "the formulation" includes reference 
to one or more formulations and equivalents thereof known to those skilled in the 

25 art, and so forth. 

All publications mentioned herein are incorporated herein by reference for 
the purpose of describing and disclosing, for example, the methods and 
methodologies that are described in the publications which might be used in 
connection with the presently described invention. The publications discussed 

30 above and throughout the text are provided solely for their disclosure prior to the 
filing date of the present application. Nothing herein is to b construed as an 
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admission that the inventors are not entitled to antedate such disclosure by virtue 
of prior invention. 

Experimental 

5 The following examples are put forth so as to provide those of ordinary skill 

in the art with a complete disclosure and description of how to make and use the 
subject invention, and are not intended to limit the scope of what is regarded as 
the invention. Efforts have been made to ensure accuracy with respect to the 
numbers used (e.g. amounts, temperature, concentrations, ete.) but some 
10 experimental errors and deviations should be allowed for. Unless otherwise 
indicated, parts are parts by weight, molecular weight is average molecular 
weight, temperature is in degrees centigrade; and pressure is at or near 
atmospheric. 

15 Methods 

Two different types of sequence searches were performed. The first 
centered on the most highly conserved region of the K+ channel family, the pore 
domain. The pore is composed of 15-17 amino acids and can be divided into 
subfamilies based on the number of transmembrane segments present in the 
20 channel. Eleven variant peptide sequences corresponding to the pore domain 
were used in TBLASTN searches against the EST division of Genbank. 
Significant matches were identified, and classified into 2 categories: identical to 
known human K+ channels and related to known K+ channels. The pore 
sequences are shown in Table 2. 
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The unique pore peptides sequences are shown in Table 3. 



TABLE 3 



SEC ID NO 


Ammo acid sequence 


fifl 


vvvvnvvgivii l VO i OL/IV! 


09 


\AAA/A\A/TMTT1 nV(Zr\KA 
VVVVMVV 1 (VI 1 1 LUToUIVl 


70 


WWGWTVTTIGYGDK 

yvvvwv v i v i i i\j i \jur\ 


71 


WWAWTMTTVGYGDM 
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FLFSIEVQVTIGFGG 


73 


FLFSLESQTTIGYGV 


74 


FLFSIETETTIGYGY 


75 


FLFSIETQTTIGYGF 


76 


FLFSVETQTTIGYGF 


77 


FLFSLESQTTIGYGF 


78 


FLFSIETETTIGYGF 






79 


ALYFTFSSLTSVGFGN 



5 The second set of experiments was based on a complex, reiterative process. 

Annotated protein and DNA sequences were obtained from GenBank for all known K+ 
channels from all species. The TBLASTN and BLASTN programs were used to identify 
homologous ESTs, which were then analyzed using the BLASTX and BLASTN 
algorithms to identify ESTs which were related to K+ channels yet not identical to any 
10 known human K+ channel gene. 

Novel human K+ channels were defined as those that had clear homology to 
known K+ channels from any species and were not present as identities or near 
identities to any human-derived sequences in any division of Genbank. 

15 Isolation of full length cDNA sequence. EST clones were picked from the 

IMAGE consortium cDNA library and end-sequenced with vector primers. Gap closure 
was achieved either by primer walking or transposon sequencing. GeneTrapper (Life 
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Technologies) was used to isolate larger cONA clones according to the provided 
protocol. RACE was used to extend the sequences as necessary using standard 
protocols. 

Sequences were assembled in Sequencher (Gene Codes). The presence of 
5 open reading frames was assessed as well as potential start codons. Potential 
polymorphisms were detected as sequence variants between multiple independent 
clones. Sequence homologies were detected using the BLAST algorithms. 

The completed gene sequences and predicted amino acid sequences are 
provided as SEQ ID NO:1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21-24, 26 and 28-29. 
10 Polymorphisms, chromosome locations and family assignments are shown in Table 1 . 

ESTs that had top human hits with >95% identity over 100 amino acids were 
discarded. This was based upon the inventors' experience that these sequences were 
usually identical to the starting probe sequences, with the differences due to sequence 

15 error. The remaining BLASTN and BLASTX outputs for each EST were examined 
manually, i.e., ESTs were removed from the analysis if the inventors determined that 
tfie variation from the known related probe sequence was a result of poor database 
sequence. Poor database sequence was usually identified as a number of W 
nucleotides in the database sequence for a BLASTN search and as a base deletion or 

20 insertion in the database sequence, resulting in a peptide frameshift, for a BLASTX 
output ESTs for which the highest scoring match was to non-related sequences were 
also discarded at this stage. The EST sequences that correspond to each clone are 
shown in Table 4. 

Table 4 



Genbank 
Accession# 
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done ID 
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R35S26 
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Example 2: Chromosomal Localization 

Two primers were designed in the 3'Hintranslated regions of each gene 
sequence to amplify a product across the Stanford G3 radiation hybrid map. or the 
Whitehead GB4 panel. The PCR data were submitted for automatic two-point analysis. 
Mapping data were correlated with cytoband information and comparisons with the 
OMIM human gene map data base were made. The following primers were made: 



10 



fflMClSSF: W TATCCACATCAATGGACAAAGC 3' 
(SEQ ID NO:32) R: 5' TGCATAACTGGCTGGGTGTA 3 
Results: 1.71 cR from D2S331 , Cytogenetic location of 2q37 



Results: 

K+Hnov2onG3 
15 F: 5" GTCAGGTGACCGAGTTCA 3' 
fcSGCTCCATCTCCAGATTCTTCIT 

Results: 0.0 cR from SHGC-1320, Cytogenetic location of 11q12 

20 (stc^Na^F: 5' TGACATCACTGG^GMCTTGA 3' 
SEQ ID N0:34) Ft 5' TGCCTGCAAAGTTTGAACAT3 
Results: 5.23 cR from WI-5509. Cytogenehc location of 2p23 

25 SSdiSSf: 5' TGACATCACTGGATGAACTTGA 3* 
(SEQ ID NO:36) R: 5' TGCCTGCAAAGTTTGAACAT 3 
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Results: 1 .21 cR from AFM20OVC7, Cytogenetic location of 8q23 
K+Hnov1 1 on GB4 

(SEQ ID NO:37) F: 5' ACCTGGTGGTATGGAAGCAT 3* 
5 (SEQ ID NO:38) R: 5' TTTCTCCTGGCCTCTACCC 3' 

Results: 2.43 cR from WI-6756, Cytogenetic location of 8q23 

K+Hnov12 on G3 

(SEQ ID NO:39) F: 5' TCCCTCTTGGGTGACCTTC 3* 
10 (SEQ ID NO:40) R: S ATCTTTGTCAGCCACCAGCT 3' 

Results: 7.45 cR from SHGC-32925, Cytogenetic location of Xp21 

K+Hnov14on GB4 

(SEQ ID NO:41) F: 5' AGGTGTGCTGCCATCTGCTGTTCG3' 
1 5 (SEQ ID NO:42) R: 5' AGCCTATCCTCTCTGAGAGTCAGG 

Results: 7.69 cR from WI-71 07, Cytogenetic location of 1 2q1 4 

K+Hnov28 on GB4 

(SEQ ID NO:43) F: 5' AAGCAGAGTACTCATGATGCC 3' 
20 (SEQ ID NO:44) R: 5* TCTGGTAGACAGTACAGTGG 3' 

Results: 35.38 cR from WI-9695, Cytogenetic location of 3q29 

K+Hnov42 on G3 

(SEQ ID NO:45) F: 5 CATTTGGCTGGTCCAAGATG 3' 
25 (SEQ ID NO:46) R: 5* AGTCATTGGTAGGGAGGTAC 3' 

Results: 7.45 cR from SHGC-32925, Cytogenetic location of Xp21 

K+Hnov44 on G3 

(SEQ ID NO:47) F: S CATGCTTCTACAGTCCAGCC 3* 
30 (SEQ ID NO:48) R: 5* GGTCCTCAGTTGCAGAAATC 3' 

Results: 7.45 cR from SHGC-32925, Cytogenetic location of Xp21 

Map positions for K+Hnov15 and K+Hnov27 were obtained from public databases. 
K+Hnov2 and K+Hnov4 have not been mapped. 



Example 3: Expression Analysis 

RT-PCR was utilized to characterize the expression pattern of the novel ion 
channels. This approach used RNA from 30 different tissues to generate first strand 
cDNA. Total RNA was purchased (Clontech, Invitrogen) and used to synthesize first 
40 strand cDNA using M-MLV reverse transcriptase and the supplied buffer (Gibco-BRL). 
The 20 pi reaction contained 5 \xg total RNA, 100 ng of random primers, 10 mM DTT 
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0.5 mM each dNTP, and an RNAse inhibitor (Gibco-BRL). Identical reactions were set 
up without reverse transcriptase to control for ONA contamination in the RNA samples. 
The synthesis reaction proceeded for 1 hour at 37°C followed by 10 minutes at 95°C. 
These cDNAs, along with control cONA synthesis reactions without reverse 
5 transcriptase, were diluted 1 :5 and 2 m> of each sample were arrayed into 96-well trays, 
dried, and resuspended in PCR buffer prior to PCR amplification. The cDNAs were 
tested with primers with defined expression patterns to verify the presence of 
amplifiable cDNA from each tissue. Gene-specific primers were used to amplify the 
cDNAs in 20 pi PCR reactions with standard conditions, 25 mM MgCI 2 , Taq Gold, and 

10 an appropriate annealing temperature. 

This approach provides for relatively high-throughput analysis of gene 
expression in a large set of tissues in a cost-efficient manner and provides qualitative 
analysis of gene expression only. Modifications can be employed, such as the use of 
internal control primers, limited cycling parameters, and dilution series to convert this 

15 to a quantitative experiment 
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K+Hnov49 on Whitehead GB4 RH mapping panel: 

Primer 1 (SEQ ID NO:5): 5' - CATAGCCATAGGTGAGGACT - 3' 
Primer 2: (SEQ ID N:6) 5' - GAGAGGAAAACAGTCTGGGC - 3' 
5 Results: Cytogenetic location 1 q41 , 4.6cR from framework marker D1 S21 7 

K+Hnov59 on Whitehead GB4 RH mapping panel 

Primer 1 (SEQ ID NO:7): 5' - GGACATCGAACTAAGACCTG - 3' 
Primer 2 (SEQ ID NO:8): 5' - TCCCATGCCATTCAGATCTG - 3' 
10 Results: Cytogenetic location 19q1 3.2, 8.34cr from framework marker D19S425 

Expression Analysis of K+HNOV49 

A probe was created from a fragment corresponding to nucleotides 50 to 
1284 of SEQ ID NO:83 (K+Hnov49) and purified DNA fragment was labeled with 
15 pPJdCTP (Amersham) by the random primer method. Adult human Multiple 
Tissue Northern (MTM™) Blots (Clontech) were hybridized with the pPJ-labeled 
fragment in ExpressHyb™ solution (Clontech) for four hours, washed to a final 
stringency of O.lxSSC, 0.1% SDS at 65 °C and subjected to autoradiography for 
24 hours. 

20 Analysis revealed that K+Hnov49 is expressed as an approximately 4.2kb 

mRNA. Expression levels of K+Hnov49 are high in brain and liver and low in 
kidney tissues. No mRNA was detectable on these Northern blots for heart, 
skeletal muscle, colon, thymus, spleen, small intestine, placenta, lung or 
peripheral blood leukocytes indicating either a very low level of expression or that 

25 it is not expressed in these tissues. Expression analysis was also carried out by 
RT-PCR across an extended series of tissues. The results of these analyses are 
shown in Table 4. Primer pairs used for amplification of K+Hnov49 and 59 are 
the same as those used for RH mapping as indicated above. 
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Table 4 
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What is Claimed is: 

1 An isolated nucleic acid encoding a mammalian K+Hnov protein. 

2. An isolated nucleic acid according to Claim 1, wherein said K+Hnov 
5 protein has the amino acid sequence of SEQ ID NO:2, 4, 6. 8, 10, 12, 14, 16, 18, 

20,25,27, 30, 81 or 83. 

3. An isolated nucleic acid according to Claim 1, wherein said K+Hnov 
protein has an amino acid sequence that is substantially identical to the amino 

10 acid sequence of SEQ ID NO:2. 4, 6, 8, 10, 12, 14, 16, 18. 20, 25. 27, 30. 81 or 
83. 

4. An isolated nucleic acid according to Claim 1 wherein the nucleotide 
sequence of said nucleic acid is SEQ ID NO:1. 3. 5, 7. 9, 11. 13. 15, 17. 19, 21. 

15 22. 23. 24. 26, 28, 29, 80 or 82. 

5. An isolated nucleic acid that hybridizes under stringent conditions to 
a nucleic acid sequence of claim 4. 

20 6. An expression cassette comprising a transcriptional initiation region 

functional in an expression host, a nucleic acid having a sequence of the isolated 
nucleic acid according to Claim 1 under the transcriptional regulation of said 
transcriptional initiation region, and a transcriptional termination region functional 
in said expression host. 



25 



7. A cell comprising an expression cassette according to Claim 6 as 
part of an extrachromosomal element or integrated into the genome of a host cell 
as a result of introduction of said expression cassette into said host cell, and the 
cellular progeny of said host cell. 
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8. A method for producing mammalian K+Hnov protein, said method 
comprising: 

growing a cell according to Claim 7, whereby said mammalian K+Hnov 
protein is expressed; and 
5 isolating said K+Hnov protein free of other proteins. 

g. A purified polypeptide composition comprising at least 50 weight % 
of the protein present as a K+Hnov protein or a fragment thereof. 



10 



1 0. A monoclonal antibody binding specifically to a K+Hnov protein. 

11. A non-human transgenic animal model for K+Hnov gene function 
wherein said transgenic animal comprises an introduced alteration in a K+Hnov 
gene. 

15 

12. The animal model of claim 1 1 , wherein said animal is heterozygous 
for said introduced alteration. 

13. The animal model of claim 12, wherein said animal is homozygous 
20 for said introduced alteration. 

14. The animal model of claim 12. wherein said introduced alteration is 
a knockout of endogenous K+Hnov gene expression. 
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SEQUENCE LISTING 



<110> Miller, Andrew 
Curran, Mark 
Buckler, Alan 

<120> Novel Human Potassium Chann* 



<130> SEQ-15PCT 

<150> 60/076,687 
<151> 1998-02-25 

<150> 60/095,836 
<151> 1998-08-07 

<150> 60/116,448 
<151> 1999-01-19 

<160> 87 

<170> FastSEQ for Windows Version 3.0 

<210> 1 

<211> 2932 

<212> DNA 

<213> H. sapiens 

<220> 
<221> CDS 

<222> (103) ... (1180) 
<223> K+Hnovl 
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S sfr ser Isp llu <*. Pro Phe Phe He Phe Pro Leu Thr Tyr Tyr 

235 240 



230 



= = S = E = 2 2 2 2 = 2 = = 2 E 

245 2 50 255 

= S S = 2 2 2 S S 2 S = S = = S 

265 270 



280 285 



S 2 2 2 = 2 2 E = 2 2 = S S ='S 

295 3°° 30 



S E 2 2 S 3 2 S 2 S 2 3 2 S 2 E 

310 315 
ttt cca act cct ctg gtt tct aaa age cca aac agg act gac ctg gat 



402 



450 



498 



546 



594 



642 



690 



738 



786 



834 



882 



930 



978 



1026 



1074 



1122 
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Phe Pro Thr Pro Leu Val Ser Lys Ser Pro Asn Arg Tor Asp Leu Asp 
325 330 " 

SSSS5SSS52S2SSSS 

345 350 
gga ctg aca g aataagactt atccattttt taatgtatta aatacaccca 
Gly Leu Thr 



1170 



1220 



gccagttatg 
tatagctctc 
atacacattc 
gtaactcaac 
tatgatacta 
aaatcaacca 
ctagctccct 
gatttttttc 
agtattacct 
tacagattta 
gttcacttat 
gtggtatttt 
tcaaacggtg 
gtatgattca 
tatttaagtc 
aaatctacag 
cctgtttcca 
aggaggttgg 
atgtttcctc 
ttaagatgat 
caaaacaaaa 
tctgaaattt 
gtgatgacct 
gtgtgaagag 
gaaaataaag 
cacagaaact 
gtagaaatca 
agaattctta 
attgaagact 



cagctacttt 
tacatcacgg 
tggaaacata 
ttgacagaca 
gtaatgaagg 
ttaatctctc 
gggtggaatg 

aagacaacaa 
ccttaatttt 
tacataaacc 
ttgtggcaga 
gcattgcaag 
attcatctaa 
tttgatcata 
tttatagata 
tagataaagc 
acttcaaata 
actagatgaa 
tgttttttat 
catttgcatg 
accaaaaaag 
cccttattca 
aggcaggaat 
cagggctcag 
aagaactctg 
taattattgt 
gaccaacaga 
tttctataat 
taaaaaaaaa 



ttctttactg 
taatcatgcc 
acactctaca 
cttatacaga 
caaaatggac 
attttcatct 
acgacttcac 
agatcattca 
taacaactaa 
taaaagcatt 
aggtgatcag 
gagacgcaaa 
atgacttcta 
taaaacatca 
taaagcttta 
aaaagataat 
gttttttcta 
ctctattatt 
tttatctatg 
attttcattt 
tcaccctatc 
ggtttctgtg 
catctcttga 
cagcaagtca 
gagtggtcta 
gaacttttgc 
ttttcccaac 
aaattaacaa 
aaaaaaaaaa 



tatctcatgt 
tatgcctaca 
ttacaaagtt 
aatgttgctg 
agtgaagttt 
gcaaattgaa 
tatacttagt 
tttggttctt 
gaacaaaaat 
tgaacatgac 
ataagctcca 
attttatttt 
gcaacctaag 
tgatggctct 
cttagatata 
taggcaacaa 
taaacacaaa 
tctttctaaa 
ctaaaatgag 
caataaaaaa 
aggtttcaaa 
ggaaaaatga 
aataaatact 
catttttcta 
agactgataa 
tgtttgaaaa 
ccaagactat 
gattcaccta 



tttctttttt 
taagaatggc 
tgttacctgc 
gtgaatttat 
aacacaactg 
gcaacagttt 
gaatatcctt 
tatactatga 
taacgagaaa 
acccgaacac 
gcccaaatgg 
aaaactgtcc 
taaaaacatt 
aattcataaa 
acttgagtga 
agcattttca 
atcagtgttt 
tctaatagtc 
ccctttccct 
aagagaaact 
cagatttgtg 
aagattaacc 
agctgagtaa 
ctatttgacc 
tagcagaaga 
tcttagacat 
tgtaacacat 
acctttgaaa 



caatgctaat 
tgagctaaca 
tgaaatcaat 
aagaatgtgg 
aactctaaga 
agtttcaaac 
taagagctgg 
aacttgagta 
aacaacaaag 
atacatatat 
aacctgtggg 
tccataataa 
cccctcctat 
tacaaaaata 
gtagggaaaa 
aactcaaatt 
attcaccagt 
tataaaaatt 
tatgtccagt 
gtccttaaaa 
gctgttcttt 
ttccccactg 
aggcaagcag 
aaaaggaaaa 
atatcaagaa 
tcattcttaa 
aaagacagca 
ataaagtagt 



aa 



1280 

1340 

1400 

1460 

1520 

1580 

1640 

1700 

1760 

1820 

1880 

1940 

2000 

2060 

2120 

2180 

2240 

2300 

2360 

2420 

2480 

2540 

2600 

2660 

2720 

2780 

2840 

2900 

2932 



<210> 2 
<211> 359 
<212> PRT 
<213> H. sapiens 



<400> 
Met Asp Ser 
1 

Tyr Arg Arg 

Gly Ala Gin 
35 

Met Asp Met 
50 

Val Val His 
65 

Met Asn Gly 



2 

Ser Asn 
5 

Met Val 
20 

Arg Gly 
Arg Trp 
Trp Leu 



Cys Lys Val 

Thr Lys Asp 

Leu Ala Tyr 
40 

Arg Trp Met 
55 

Val Phe Ala 
70 

Glu Leu Asp 



He Ala Pro 
10 

Gly His Ser 
25 

Leu Arg Asp 



Leu Leu 
Thr Leu 



Ala Trp 
45 

Met Leu Val Phe Ser 
60 

Tyr Val 



Asp Leu 
85 

Thr He Cys Val Lys Tyr He Thr 
100 



Val Leu Trp 
75 

His Asp Ala 
90 

Ser Phe Thr 
105 



Pro Pro 
Ala Ala 



Ser Gin Arg 
15 

Gin Met Asp 
30 

Gly He Leu 

Ala Ser Phe 

Leu Ala Glu 
80 

Glu Asn His 
95 

Phe Ser Phe 
110 
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Ser Leu Glu Thr Gin Leu Thr lie Gly Tyr Gly Thr Met Phe Pro Ser 

n1 c 120 125 

Gly Asp Cys Pro Ser Ala lie Ala Leu Leu Ala lie Gin Met Leu Leu 

135 1^0 
Gly Leu Met Leu Glu Ala Phe lie Thr Gly Ala Phe Val Ala Lys lie 

150 lbb 
Ma Arg Pro Lys Asa Arg Ala Phe Ser lie Arg Phe Thr Asp Thr Ala 

Val Val Ala His Met Asp Gly Lys Pro Asn Leu lie Phe Gin Val Ala 

185 



180 

Asn Thr Arg Pro Ser Pro Leu Thr Ser Val Arg Val Ser Ala Val Leu 

iqc 200 
Tyr Gin III Arg Glu Asn Gly Lys Leu Tyr Gin Thr Ser Val Asp Phe 

215 22 
His Leu Asp Gly lie Ser Ser Asp Glu Cys Pro Phe Phe He Phe Pro 

230 4»«J3 
Su Thr Tyr Tyr His Ser lie Thr Pro tar Ser Pro Leu Ala Thr Leu 

L eu Gin His Glu tel Pro Ser His Phe Glu Leu Val Val Phe Leu Ser 

260 265 
Ala Met Gin Glu Gly Thr Gly Gin He Cys Gin Arg Arg Thr Ser Tyr 

"> 7 c 280 , 

Leu Pro Ser Glu lie Met Leu His His Cys Phe Ala Ser Leu Leu Thr 

^ Gly Ser Lys Gly Glu Tyr Gin He Lys Met Glu Asn Phe Asp Lys 

310 313 

Thr Val Pro Glu Phe Pro Thr Pro Leu Val Ser Lys Ser Pro Asn Arg 

325 330 
Thr Asp Leu Asp He His He Asn Gly Gin Ser He Asp Asn Phe Gin 

340 345 
He Ser Glu Thr Gly Leu Thr 
355 

<210> 3 
<211> 1927 
<212> DNA 
<213> H. sapiens 

<220> 
<221> CDS 

<222> (105)... (1908) 
<223> K+Hnov4 

< 400> 3 . ^^t-^rsr cacaaacqca 60 



w=sss==sssss=r=sss - 

1 

a is 2 a s s s s = a s s = s s s 

5 10 15 

tac cqc age acc ctg cgc acc eta ccg gga ace cgc etc gee tgg 
S £ £g Ser Thr Leu Arg Thr Leu Pro Gly Thr Arg Leu Ala Trp 

etg gee gae eee gae gge ggg ggc egg eee gag aee gat gge ggc ggt 
Zl Ala Asp Pro Asp Gly Gly Gly Arg Pro Glu Thr Asp Gly Gly Gly 



164 



212 



260 



4 
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ata gat age age gge age age ggc ggc ggg ggc tgc gag ttc ttc ttc 
S SJ Ser Ser lly Ser Ser Gly Gly Gly Gly Cys Glu Phe Phe Phe 
55 60 65 

aac aaa cac ccg ggc gtc ttc gee tac gtg etc aac tac tac cgc ace 
Sp Arg His Pro £y Val Phe Ala Tyr Val Leu Asn Tyr Tyr Arg Thr 
70 75 80 

aac aaa ctg cac tgc ccc gca gac gtg tgc ggg ccg etc ttc gag gag 
ggc aag ctg a = Le U phe Glu Glu 

Gly Lys Leu His Cys Pro Ala Asp Val cys feiy fro ^ 

85 9° 95 

r,„ a eta ace ttc tag ggc ate gac gag ace gac gtg gag ccc tgc tgc 
S Leu £a Phe 3 "y He Lp Glu Thr Asp Val Glu Pro Cys Cys 
105 110 

toa atg ace tac egg cag cac cgc gac gee gag gag gcg ctg gac ate 
Trp Met Thr Tyr Axg Gin His Arg Asp Ala Glu Glu Ala Leu Asp lie 
120 125 

ttc gag acc ccc gac etc att ggc ggc gac ccc ggc gac gac gag gac 
Phe Glu Thr Pro Asp Leu lie Gly Gly Asp Pro Gly Asp Asp Glu Asp 
135 ' 140 

eta acg gee aag agg ctg ggc ate gag gac gcg gcg ggg etc ggg ggc 
2u IS Ma Lys Arg Leu Gly He Glu Asp Ala Ala Gly Leu Gly Gly 
150 155 160 

ccc gac ggc aaa tct ggc cgc tgg agg agg ctg cag ccc cgc atg tgg_ 
Pro Sp Lys Ser Gly Arg Trp Arg Arg Leu Gin Pro Arg Met Trp 



165 



170 



acc etc ttc gaa gac ccc tac teg tec aga gee gec agg ttt att get 
£a Leu Phe Glu Lp Pro Tyr Ser Ser Arg Ala Ala Arg Phe lie Ala 
185 I 90 195 

ttt act tet tta ttc ttc ate ctg gtt tea att aca act ttt tgc ctg 
Phe £ Ser Leu Phe Phe lie Leu Val Ser lie Thr Thr Phe Cys Leu 
200 205 

aaa aca cat gaa get ttc aat att gtt aaa aac aag aca gaa cca gtc 
1" As Glu Ala Phe Asn lie Val Lys Asn Lys Thr Glu Pro Val 
215 220 225 

afr aat aac aca agt gtt gtt eta cag tat gaa att gaa acg gat cct 
S Asn % Tnr Ser Val Val Leu Gin Tyr Glu He Glu Thr Asp Pro 
230 «S 240 

gec ttg acg tat gta gaa gga gtg tgt gtg gtg tgg ttt act ttt gaa 
i£a Leu Thr Tyr Val Glu Gly Val Cys Val Val Trp Phe Thr Phe Glu 
245 2 50 255 

ttt tta gtc cgt att gtt ttt tea ccc aat aaa ctt gaa ttc ate aaa 
Phe Leu val Z g lie Val Phe Ser Pro Asn Lys Leu Glu Phe lie Lys 
265 270 



f ^ aat ate att qac ttt gtg gec ate eta cct ttc tac tta 
£ n 2u £u Asn S lie Asp Phe Va? Ala He Leu Pro Phe Tyr Leu 
280 285 290 

gag gtg gga etc agt ggg ctg tea tec aaa get get aaa gat gtg ctt 



308 



356 



404 



452 



500 



548 



596 



644 



692 



740 



788 



836 



884 



932 



980 



1028 



5 
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Glu val Gly Leu Ser Gly Leu Ser Ser Lys Ala Ala Lys Asp Val Leu 
295 300 305 

ggc ttc etc agg gtg gta agg ttt gtg agg ate ctg aga att ttc aag 



III III Leu Arg val Val Arg Phe Val Arg He Leu Arg lie Phe Lys 
310 315 320 

etc acc cgc cat ttt gta ggt ctg agg gtg ctt gga cat act ctt cga 
2u Thr tog His Phe Val Gly Leu Arg Val Leu Gly Hxs Thr Leu Arg 
325 330 335 

act agt act aat gaa ttt ttg ctg ctg ata att ttc ctg get eta gga 
£ Ser tL Asn Ilu Phe Leu Leu Leu lie lie Phe Leu Ala Leu Gly 
345 350 

t-t-c ata ttt get acc atg ate tac tat gec gag aga gtg gga get 
£ Leu X Phe 5. Thr Me? He Tyr Tyr Ala Glu Arg Val Gly Ala 
360 365 

caa cct aac gac cct tea get agt gag cac aca cag ttc aaa aac att 
X Pro Asn Lp Pro Ser Ala Ser Glu His Thr Gin Phe Lys Asn lie 
375 380 385 

ccc att ggg ttc tgg tgg get gta gtg acc atg act acc ctg ggt tat 
Pro lie Gly Phe Trp Trp Ala Val Val Thr Met Thr Thr Leu Gly Tyr 
390 395 «0 



ggg gat atg tac ccc caa aca tgg tea ggc atg ctg gtg gga gee ctg 
l!y Lp Met Tyr Pro Gin Thr Trp Ser Gly Met Leu Val Gly Ala Leu 
405 410 415 

t-trt act eta act gga gtg ctg aca ata gee atg cca gtg cct gtc att 
gs IS Leu Ma lly Val Leu Thr He Ala Met Pro Val Pro Val He 
425 430 

n +n *at aat ttt gga atg tac tac tec ttg gca atg gca aag cag aaa 
E £ £ Phe Hel Tyr Tyr Ser Leu Ala Met Ala Lys Gin Lys 
440 445 450 

ctt cca aoq aaa aga aag aag cac ate cct cct get cct cag gca age 
Leu Pro S £s Lys Lys His lie Pro Pro Ala Pro Gin Ala Ser 
455 460 465 

tea cct act ttt tgc aag aca gaa tta aat atg gee tgc aat agt aca 
Ser Pro Tnr Phe Cys Lys Thr Glu Leu Asn Met Ala Cys Asn Ser Thr 
470 4 75 480 

cag agt gac aca tgt ctg ggc aaa gac aat cga ctt ctg gaa cat aac 
G?n Ser Lp Thr Cys Leu Gly Lys Asp Asn Arg Leu Leu Glu His Asn 
485 490 495 

aaa tea gtg tta tea ggt gac gac agt aca gga agt gag ccg cca eta 
5 Ser Va? Leu Ser G?y Asp Asp Ser Thr Gly Ser Glu Pro Pro Leu 
505 510 515 

tea ccc cca gaa agg etc ccc ate aga cgc tct agt acc aga gac aaa 
Ser Pro Pro Glu Arg Leu Pro lie Arg Arg Ser Ser Thr Arg Asp Lys 
520 525 530 

aac aaa aaa qgg gaa aca tgt ttc eta ctg acg aca ggt gat tac acg 
^SS G?y Glu Thr Cys Phe Leu Leu Thr Thr Gly Asp Tyr Thr 



1076 



1124 



1172 



1220 



1268 



1316 



1364 



1412 



1460 



1508 



1556 



1604 



1652 



1700 



1748 
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535 540 



545 



tat act tct gat gga ggg ate agg aaa gga tat gaa aaa tec cga age 
gs S sll Lp Sy Gly lie Arg Lys Gly Tyr Glu Lys Ser Arg Ser 
550 555 560 

tta aae aae ata geg gge ttg gea gge aat get etg agg etc tct cea 
Leu Asn Asn lie Ala Gly Leu Ala Gly Asn Ala Leu Arg Leu Ser Pro 
565 570 575 

ata aca tea ccc tac aae tct ect tgt cct etg agg cgc tct cga tct 
Val ^hr Ser Pro Tyr Asn Ser Pro Cys Pro Leu Arg Arg Ser Arg Ser 

ccc ate cea tct ate t tgtaaaccaa accctcgtg 
Pro He Pro Ser He 
600 



1796 



1844 



1892 



1927 



<210> 4 
<211> 601 
<212> PRT 
<213> H. sapiens 



Met Ala^ys Gly Glu Ala Ser Glu Lys lie lie He Asn Val Gly Gly 

Tnr Arg His Glu tL Tyr Arg Ser Thr Leu Arg Thr Leu Pro Gly Thr 

Arg Leu Ala Trp Leu Ala Asp Pro Asp Gly Gly Gly Arg Pro Glu Thr 

40 45 
Asp-Gly Gly Gly Val Gly Ser Ser Gly Ser Ser Gly Gly Gly Gly Cys 

cn 55 
Glu Phe Phe Phe Asp Arg His Pro Gly Val Phe Ala Tyr Val Leu Asn 



%r Tyr Arg Thr Gly Lys Leu His Cys Pro Ala Asp Val Cys Gly Pro 

85 90 
^ Phe Glu Glu Glu Leu Ala Phe Trp Gly lie Asp Glu Thr Asp Val 



Glu Pro Cys Cys Trp Met Thr Tyr Arg Gin His Arg Asp Ala Glu Glu 



120 



Ala Leu Asp He Phe Glu Thr Pro Asp Leu He Gly Gly Asp Pro Gly 
Asp Asp Glu Asp Leu Ala Sa Lys Arg Leu Gly lie Glu Asp Ala Ala 
"ly Leu Gly Gly Pro Asp Gly Lys Ser Gly Arg Trp Arg Arg Leu Gin 



Pro Arg Met Trp Ala Leu Phe Glu Asp Pro Tyr Ser Ser Arg Ala Ala 

180 I 85 19 ^ 

Arg Phe He Ala Phe Ala Ser Leu Phe Phe He Leu Val Ser He Thr 

200 205 



Thr Phe Cys Leu Glu Thr His gIu Ala Phe Asn He Val Lys Asn Lys 



Thr III Pro Val He Asn Gly Thr Ser Val Val Leu Gin Tyr Glu He 



230 



210 215 220 

S Thr Asp Pro Ala Leu Thr Tyr Val Glu Gly Val Cys Val Val Trp 

245 250 
Phe Thr Phe Glu Phe Leu Val Arg He Val Phe Ser Pro Asn Lys Leu 

260 265 270 

Glu Phe lie Lys Asn Leu Leu Asn He He Asp Phe Val Ala He Leu 
275 280 285 
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Pro Phe Tyr Leu Glu Val Gly Leu Ser Gly Leu Ser Ser Lys Ala Ala 
Lys Z val Leu Gly Phe Leu Arg Val Val Arg Phe Val Arg lie Leu 



295 300 
Arg Val Val 
510 315 
3 lie Phe Lys Leu Thr Arg His Phe Val Gly Leu Arg Val Leu Gly 

345 

Gly Val Leu lie Phe 

355 



325 3iU i_ 

His Thr Leu Arg Ala Ser Thr Asn Glu Phe Leu Leu Leu lie lie Phe 

L eu Ala Leu Gly Val Leu lie Phe Ala Thr Met He Tyr Tyr Ala Glu 

arg val Gly Ala Gin Pro Asn Asp Pro Ser Ala Ser Glu His Thr Gin 

Phe Lys Asn Xle Pro He Gly P- «P Trp Ala Val Val Thr Met Thr 

£ Leu Gly Tyr Gly Asp Met Tyr Pro Gin Thr Trp Ser Gly Met Leu 



val Gly Ala Leu Cys Ala Leu Ala Gly Val Leu Thr Xle Ala Met Pro 



425 



val Pro val fie Val Asn Asn Phe Gly Met Tyr Tyr S« Leu Ala Met 

,1a L ys Gin Lys Leu Pro Arg Lys Arg Lys Lys His Xle Pro Pro Ala 

Pro Gin Ala ser Ser Pro Tar Phe Cys Lys Thr Glu Leu Asn Met Ala 
470 47b 
Asn ser Thr Gin Ser Asp Thr Cys Leu Gly Lys Asp Asn Arg Leu 

Glu His Asn Arg Ser Val Leu Ser Gly Asp Asp Ser Thr Gly Ser 

Glu Pro Pro Leu Ser Pro Pro Glu Arg Leu Pro Xle Arg Arg Ser Ser 

Thr Arg Asp Lys Asn Arg Arg Gly Glu Thr Cys Phe Leu Leu Thr Thr 

G ly Isp Tyr Thr Cys Ala III Asp Gly Gly Xle Arg Lys Gly Tyr Glu 

£ ser Arg Ser Leu Asn Asn Xle Ala Gly Leu Ala Gly Asn Ala Leu 

arg .eu Ser Pro Val Thr Ser Pro Tyr Asn Ser Pro Cys Pro Leu Arg 
580 585 



Arg Ser Arg Ser Pro lie Pro Ser He 
595 600 

<210> 5 
<2H> 2293 
<212> DNA 
<213> H. sapiens 

<220> 
<221> CDS 

<222> (330)..- (1800) 
<223> K+Hnov6 

gggaag^aacccagggc ccttgctctc gtgcagcgct g^ccctggg tggggaeggc SO 
Saggcttg cagcgcaggt gagagtgatt ttccaggat tgctttggcc tg^^ ^ 

sss 2S=s =~ Ss rib SS5 - 

ggcagagcgt gtggcatctc cacctcaagg gtgcagcctg atcttretrt tc a 
SgcLgcac tctgccttct gtatccacc atg gtg ttt ggt gag ttt ttc ^ 



1 



8 
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S £ 5 = S S = £ C = S = SI S ST, 2 
5 = = S5 = S = = = SS=S = = = 

30 

= If, SI = a £ 5 S £ E S E S 2 H 2 

45 50 

Eaaassaaaasaaaaa 

75 80 

S C S S H = S S S = S I = = = = 

90 95 

sss=====sss3:s==s 

105 110 

S2ES52ESSB5SSS2;- 

125 130 

S S C S! S = S = = £ = = = = S " S 

140 145 

155 160 

a ft aoa ata qag aat cca gcg tac tgc ctg tec get 
S £ S S S S m2 « - Pro Tyr Cys u. Ser Ala 
170 175 



Lys Leu lie Ala lie war — —~ ~ 2 00 

185 190 

S2SSS = S 2 = S = = = S I = 

205 210 

b s 5 = a = = a a is s s s s a s 

220 225 

£ S 5 S 2 = a S = S E S = 5 = 5 

235 240 
aaa ttc tgg aaa aac cot ctg aac ate att gac ttt gtc tct att att 



401 



449 



497 



545 



593 



641 



689 



737 



785 



833 



881 



929 



977 



1025 



1073 



1121 
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Lvs Phe Trp Lys Asn Pro Leu Asn He He Asp Phe Val Ser He lie 
250 2 55 260 



nrr ttc tat gee acg ttg get gta gac acc aag gag gaa gag agt gag 
III £ Syr Ma S Le« Ala Val Asp Thr Lys Glu Glu Glu Ser G u 
265 270 " 5 

aat att gag aac atg ggc aag gtg gtc cag ate eta egg ctt atg agg 
£ Te 111 Asn ml Gly Lys Val Val Gin He Leu Arg Leu Met Arg 
285 290 

att ttc ega att eta aag ctt gee egg eae teg gta gga ctt egg tet 
He Phe Arg He Leu Lys Leu Ala Arg His Ser Val uiy y 
300 305 

E S E 2 2 S = = 2 S E S 1 2 2 2 

315 320 

i-hr etc tet gtg ggc att tec att ttc tet gtg ctt ate tac tec 
2u Phe Leu Ser Va! G^y He Ser He Phe Ser Val L*u He Tyr Ser 
330 335 3" 



S3 S 222 = 2 2 S 2 S = S = £ 5 
S S S E 2 S 2 2 SS S 5 2222 

365 370 

cac ccg gtc ace ttg gcg gga aag etc ate gee age aca tgc ate ate 
His Pro val Thr Leu Ala Gly Lys Leu lie Ala Ser Thr Cys He He 
380 385 

„nn a tc tto atg gtg gee ctt ccc ate acc ate ate ttc aac aag 
gs Z He Su 2 val Ala Leu Pro He Thr He lie Phe Asn Lys 

hf( . aao tac t ac cag aag caa aag gac att gat gtg gac cag tgc 
Pne Ser LyI Tyr Tyr Gin Lys Gin Lys Asp He Asp Val Asp Gin Cys 
410 415 420 

agt gag gat gca cca gag aag tgt eat gag eta cct tac ttt aac att 
Ser Glu Lp Ala Pro Glu Lys Cys His Glu Leu Pro Tyr Phe Asn 
425 430 435 

agg gat ata tat gca cag egg atg cac gee ttc att acc agt etc tet 
S Lp He Tyr Ala Gin Arg Met His Ala Phe He Thr Ser Leu Ser 
445 450 



ata aac att gtq gtg age gat cct gac tec aca gat get tea age 
Ser Val 3£ S Val S Ser Lp Pro Asp Ser Thr Asp Ala Ser Ser 
460 465 

*t-h oaa aac aat gag gac att tgt aac acc acc tec ttg gag aat tgc 
Se Su Z E ™ Lp He Cys Asn Thr Thr Ser Leu Glu Asn Cys 

480 485 



1169 



1217 



1265 



1313 



1361 



1409 



1457 



1505 



1553 



1601 



1649 



1697 



1745 



1793 



475 

aea gea a aatgageggg ggtgtttgtg eetgtttete ttateettte eeaaeattag 
Thr Ala 

10 



1850 
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490 



gttaacacag ctttataaac ctcagtgggt tcgttaaaat catttaattc tca^ggtgta »» 

cctttcagcc atagttggac attcattgct 3«"ctgaa ^tgatag | 2030 

tttctctgtg aggtcaatta aatgccttgt "tgaaattt ^ttttt 2090 

SSS 3SS K== 35= ~ s=S SS 

S=S S52S S=S °™* '™ £ 

cagaaaaaaa aaaaaaaaaa aaa 



<210> 6 
<211> 490 
<212> PRT 
<213> H. sapiens 

<400> 6 



Met Val^e Gly Glu Phe Phe His Arg Pro Oly Gin Asp Glu Glu Leu 
5 10 



Val Asn Leu Asn Val Gly Gly Phe Lys Glu Ser Val Asp Gin Ser Thr 

U>» Leu Arg Phe Pro His Thr Arg Leu Gly Lys Leu Leu Thr Cys His 

Ser Glu Glu Ala He Leu Glu lL Cys Asp Asp Tyr Ser Val Ala Asp 

Ly3 Glu Tyr Tyr Phe Asp Arg Asn Pro Ser Leu Phe Arg Tyr Val Leu 

£n Phe Tyr Tyr Thr Gly Lys Leu His Val Met Glu Glu Leu Cys Val 

Ph e ser Phe Cys Gin Glu lie Glu Tyr Trp Gly He Asn Glu Leu Phe 

XXe-Asp ser Cys Cys Ser Asn Arg Tyr Gin Glu Arg Lys Glu Glu Asn 

His Glu Lys Asp Trp Asp Gin Lys Ser His Asp Val Ser Thr Asp Ser 

ser Phe Glu Glu Ser Ser £ Phe Glu Lys Glu Leu Glu Lys Phe Asp 

5£ Leu Arg Phe Gly Gin Leu Arg Lys Lys He Trp He Arg Met Glu 

Asn Pro Ala Tyr cys Leu Ser Ala Lys Leu He Ala He Ser Ser Leu 

Ser val Val £u Ala Ser He Val Ala Met Cys Val His Ser Met Ser 

Glu Phe III Asn Glu Asp Gly Glu Val Asp Asp Pro Val Leu Glu Gly 

215 



210 ,i 7v -y ^ t>hf» Thr Glv Glu Leu Ala Val Arg 

Val Glu He Ala Cys He Ala Trp Phe Tnr Giy <*xu ^ 

230 " 3 

Su Ala Ala Ala Pro Cys Gin Lys Lys Phe Trp Lys Asn Pro Leu Asn 

XXe He Asp Phe Val Ser He He Pro Phe Tyr Ala Thr Leu Ala Val 

Th r Lys Glu Glu Glu Ser Glu Asp He Glu Asn Met Gly Lys Val 

val Gin lie Leu Arg Leu Met Arg He Phe Arg lie Leu Lys Leu Ala 

^ His Ser Val Gly Leu Arg Ser Leu Gly Ala Thr Leu Arg His Ser 

£ His Glu val Gly Leu Leu Leu Leu Phe Leu Ser Val Gly He Ser 
325 



He Phe Ser Val Lu He Tyr Ser Val Glu Lys Asp Asp His Thr Ser 
340 345 



11 
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Ser Leu Thr Ser He Pro He Cys Trp Trp Trp Ala Thr lie Ser Met 
Thr Thr Val Gly Tyr Gly Asp £ His Pro Val Thr Leu Ala Gly Lys 
L eu He Ala Ser Thr Cys 2 He Cys Gly He Leu Val Val Ala Leu 

390 

HI lie Thr lie He Phe Asn Lys Phe Ser Lys Tyr Tyr Gin Lys Gin 

405 410 

Lys Asp lie Asp val Asp Gin Cys Ser Glu Asp Ala Pro Glu Lys Cys 
His Glu Leu "ro Tyr Phe Asn He Arg Asp He Tyr Ala Glu Arg Met 
His Ala Phe He Thr Ser Leu Ser Ser Val Gly He Val Val Ser Asp 
Pro Z Ser Thr Asp Ala Ser Ser He Glu Asp Asa Glu Asp He Cys 



470 



Thr Thr Ser Leu Glu Asn Cys Thr Ala 
485 490 



<210> 7 
<211> 3080 
<212> DNA 
<213> H. sapiens 

<220> 
<221> CDS 

<222> (480)... (1977) 
<223> K+Hnov9 



60 
120 
180 
240 
300 
360 



9 «^c2\Ucctcc tc^ccc-c «o = c «ccCcc«t J-^Jg 
tco.c~.cc »9t«2S SC.9t.tac 

SEES SSSS g~ J« ™ : -s- 

5SS SSSS =£S » y-- - 

S S = S 5 S S 2 S 5 2 « - - - - 
= 2 S 5= 2 S S S S 52 5 = S 5 E S 

20 25 

S S = S S S 5 5 £ = S = S S S ffi 

35 40 

50 55 
ctt ggc aag otg gcc gtg gtg gtg get tec tac cgc cgc ccc ggg gcc 
Su Sy Lys Leu Ala Val Val Val Ala Ser Tyr Arg Arg Pro Gly Ala 
65 70 

«.„ r-r-n M e cct ctq gag ctt tgc gac gat gcc aac ccc 
2 S K S Pro Ser Pro 2 U Leu Cys Asp Asp Ala Asa Pro 



575 



623 



671 



719 



767 



85 90 
12 
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s s £ a £ £ £ s £ = = = s S S £ 

100 105 

115 120 

^rr ttc eta caq gag ate cag tac tgg ggc ate gat gag 
S 2 £ S C e C r £ 2 GXn U He Gin Tyr Trp dy He Asp Gl« 
130 135 

145 150 155 

2 £ £ £ £ £ £ S S K = = = = S S 

165 170 

is s a £ E £ 5 £ £ = s - s = £ s 

180 185 

S = S = S = S = = SS55 = S55 

195 200 

k s s £ s s £ £ s s £ s a a = s 

210 215 

£ = 2 E £ 2 = S S £ £ 5 £ = = = 

225 230 

Z £ E S £ K £ S £ = = 3 = = S = 

245 250 
260 265 

ss==sss=====s £S= 

275 280 

S £ £ S s £ £ £ S £ £ £ £ = = = 

290 295 

a=S55£S2S=S2SSS2 

. 305 310 315 

S2=S25S===S===S S 



815 



863 



911 



959 



1007 



1055 



1103 



1151 



1199 



1247 



1295 



1343 



1391 



1439 



1487 



325 330 
13 
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S = S = =5 5 E = S S = = | = £ 

340 345 

£ S S 5 S = = = = = S C S SS S " 

355 360 

. nstn aca acc ttc aca agt gtc cct tgt gca tgg tgg 
VS. S S S 5 S S L * ser 

370 375 

~„ At-o act act gtg gga tat ggg gac att aga cca 

S S £ E 2 2 £ * '- ! iy Si 01y w " 3 S 

385 390 

5 S = = 5 S S S E S S 5 S = = S 

405 410 

£ 2 S 2 S = = £ = = S = 5 i = S 

420 425 

E S 5 £ = 2 S = S = = 2 S = = S 

435 440 

E K = 5 5 ESSE S S £ S = S S 

450 455 
agt gtt aac ttg aga gat gtc tat gcc egg agt ate atg gag atg ctg 
Ser Val Asn Leu Arg Asp Val Tyr Ala Arg »« 48Q 
465 470 

SSS5SESK2E = = = = Ss S 

485 490 



gat ttc tgg t tttgaattaa ttttcaattt atttacaaaa gctatgtaca 
Asp Phe Trp 



1535 



X583 



1631 



1679 



1727 



1775 



1823 



1871 



1919 



1967 



2017 



attaactaaa atgataaagc agtgatgtgg atttctgtat tctgatgatg agtctcttc* 2077 

gagtactget catcttaatt aatttttget Jtatatgc ttcatctact j, ^ ^ 

acatcaccta taacaactgc acagtgttct 9 a «^"9 „ c Ctcaaa tataaaataa 2257 

aacacaacca aatacaactg ggecaatata aacatgtttg «"gtcaaa 2317 
tgttattgca atacatacaa aaaagttaaa ga^ta tgt atcac^ 

£cat£g S ctatttaaag atattatcca agtacataaa 2437 

ttactccgtt ctctatcagt taaagctatt ^tataata cttagcttta <£**^ 2557 

cccatatttg atgggcagag attatatccc tatcttcttt "catg ct 2617 

caaatgaact gatctctgta tcccattatt a«ataagag ggggaa cattctccca 2677 

tagattgeag tacatgagtc tacacaaaga ottouaK gaatatcaga 2737 
ac?gagtgta gtatgtggag cataaaacag catatttctt agtatttcat ^ 

tggtctttaa atgtctcttt atggatgtat tgttcacatt atggc aataatctta 2857 

S SEE ™ i-s -sss -» 

2SSS SSS 2SK SEE SS— - 3 °" 
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3080 

taattaaagc caaaacctct aaagctaaaa aaaaaaaaaa aaa 

<210> 8 
<211> 499 
<212> PRT 
<213> H. sapiens 



Cys Ala £eu Ser Phe Leu Gin Glu lie Gin Tyr Trp Gly lie Asp Glu 

14 0 



Leu Ser He Asp Ser Cys Cys Arg Asp Arg Tyr Phe Arg Arg Lys Glu 



135 



155 



Met Proper Ser Gly Arg Ala Leu Leu Asp Ser Pro Leu Asp Ser Gly 

5 10 15 

Ser Leu Thr Ser Leu Asp Ser Ser Val Phe Cys Ser Glu Gly Glu Gly 

20 25 
Glu Pro Leu Ala Leu Gly Asp Cys Phe Thr Val Asn Val Gly Gly Ser 

~ 35 40 . » 

Arg Phe Val Leu Ser Gin Gin Ala Leu Ser Cys Phe Pro His Thr Arg 

c ft 55 60 

Leu Gly Lys Leu Ala Val Val Val Ala Ser Tyr Arg Arg Pro Gly Ala 
£c 70 75 

Leu Ala Ala Val Pro Ser Pro Leu Glu Leu Cys Asp Asp Ala Asn Pro 

85 90 
Val Asp Asn Glu Tyr Phe Phe Asp Arg Ser Ser Gin Ala Phe Arg Tyr 

100 i° 5 
val Leu His Tyr Tyr Arg Thr Gly Arg Leu His Val Met Glu Gin Leu 
115 120 125 

Leu 

He 

III Ser Glu Thr Leu Sp Phe Lys Lys Asp Tnr Glu Asp Gin Glu Ser 

165 170 
Gin His Glu Ser Glu Gin Asp Phe Ser Gin Gly Pro Cys Pro Thr Val 

180 XSS 13 

Arg Gin Lys Leu Trp Asn lie Leu Glu Lys Pro Gly Ser Ser Thr Ala 

Val 

lie Asn Met Ala Leu Met Ser Ala Glu Leu Ser Trp Leu Asp Leu Gin 

230 235 * 

Leu Leu Glu lie Leu Glu Tyr Val Cys lie ser Trp Phe Thr Gly Glu 

245 250 _ 

Phe val Leu Arg Phe Leu Cys Val Arg Asp Arg Cys Arg Phe Leu Arg 

260 265 270 

Lys Val Pro Asn lie lie Asp Leu Leu Ala lie Leu Pro Phe Tyr lie 

275 280 285 

Thr Leu Leu Val Glu Ser Leu Ser Gly Ser Gin Thr Thr Gin Glu Leu 

290 295 30° , , 

Glu Asn Val Gly Arg He Val Gin Val Leu Arg Leu Leu Arg Ala Leu 
->05 310 315 

Arg Met Leu Lys Leu Gly Arg His Ser Thr Gly Leu Arg Ser Leu Gly 

Met Thr He Thr Gin Cys Tyr Glu Glu Val Gly Leu Leu Leu Leu Phe 

340 345 350 

Leu Ser Val Gly He Ser He Phe Ser Thr Val Glu Tyr Phe Ala Glu 

355 360 
Gin Ser He Pro Asp Thr Thr Phe Thr Ser Val Pro Cys Ala Trp Trp 

370 375 380 

Trp Ala Thr Thr Ser Met Thr Thr val Gly Tyr Gly Asp He Arg Pro 

■>oc 390 395 

£1 Thr Thr Thr Gly Lys He Val Ala Phe Met Cys He Leu Ser Gly 

405 410 
lie Leu Val Leu Ala Leu Pro lie Ala lie lie Asn Asp Arg Phe Ser 

15 



195 2 00 
Ala Arg lie Phe Gly Val lie Ser lie lie Phe Val Val Val Ser lie 

215 220 
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420 






Ala 


Cys Tyr Phe 


Thr Leu Lys 


Leu 




435 




440 


Glu 


Ala Leu Lys 


Lys Leu Thr 


Lys 




450 


455 




Ser 


Val Asn Leu 


Arg Asp Val 


Tyr 


465 




470 


Ala 


Arg 


Leu Lys Gly 


Arg Glu Arg 




485 




Asp 


Phe Trp 







425 430 
Lys Glu Ala Ala Val Arg Gin Arg 
445 

Asn lie Ala Thr Asp Ser Tyr He 
460 

Ala Arg Ser He Met Glu Met Leu 
475 480 
Ser Thr Arg Ser Ser Gly Gly Asp 
490 495 



<210> 9 
<2H> 3424 
<212> DNA 
<213> H. sapiens 

<220> 
<221> CDS 

<222> (257) . . . (2195) 
<223> K+Hnovl2 

<400> 9 

ctcttctcca tgtccccaag gcccttctca gtccctcaga acattgccca ggcccctcct 
aqgttctgta aatgtccccc agactccttc ccatctcttt agttcttcct cctggttcct 
cttggcctct ctagacaccc ccagtttcct tgtttgggtg gctcaaggtg tctccaagcc 
cccaccatcc tggagacagc cacattctcc taaacgccac cctcactaag tctccctggg 
cttggggagt ggcacg atg gcg gca ggc ctg gcc acg tgg ctg cct ttt get 
Met Ala Ala Gly Leu Ala Thr Trp Leu Pro Phe Ala 
1 5 10 

egg gca gca gca gtg ggc tgg ctg ccc ccg gcc cag caa ccc ctg ccc 
Arq Ala Ala Ala Val Gly Trp Leu Pro Pro Ala Gin Gin Pro Leu Pro 
15 20 25 

ccg gca ccg ggg gtg aag gca tct cga gga gat grg gtt ctg gtg gtg 
Pro Ala Pro Gly Val Lys Ala Ser Arg Gly Asp Xaa Val Leu Val Val 
30 35 40 

aac gtg age gga egg cgc ttt gag act tgg aag aat acg ctg gac cgc 
Asn Val Ser Gly Arg Arg Phe Glu Thr Trp Lys Asn Thr Leu Asp Arg 
45 50 55 60 

tac cca gac acc ttg ctg ggc age teg gag aag gaa ttc ttc tac gat 
Tvr Pro Asp Thr Leu Leu Gly Ser Ser Glu Lys Glu Phe Phe Tyr Asp 
65 70 75 

get gac tea ggc gag tac ttc ttc gat cgc gac cct gac atg ttc cgc 
Ala Asp Ser Gly Glu Tyr Phe Phe Asp Arg Asp Pro Asp Met Phe Arg 
80 85 90 

cat gtg ctg aac ttc tac cga acg ggg egg ctg cat tgc cca egg cag 
His Val Leu Asn Phe Tyr Arg Thr Gly Arg Leu His Cys Pro Arg Gin 
95 100 105 

gag tgc ate cag gcc ttc gac gaa gag ctg get ttc tac ggc ctg gtt 
Glu cys He Gin Ala Phe Asp Glu Glu Leu Ala Phe Tyr Gly Leu Val 
110 H5 120 

ccc gag eta gtc ggt gac tgc tgc ctt gaa gag tat egg gac cga aag 
Pro Glu Leu Val Gly Asp Cys Cys Leu Glu Glu Tyr Arg Asp Arg Lys 

16 



60 
120 
180 
240 
292 



340 



388 



436 



484 



532 



580 



628 



676 
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130 135 "° 

125 130 



5 S 2 2 S 2 2 S H 2 SS S E S = E 

145 150 

I!? S 5 = K 2 2 E 5 2 2 2 2222 

160 165 

5 S E 2 S 2 2 2 2 2 2 E E 2 2 2 

175 180 

~ n-r ttc ate crcc gtg teg gtc ate gee aat gtg 
£ £ S S 5 £ £ S La Sa! sef VaX Xle Ma Asn VaX 
190 195 

SBSSSSS5SS5SSSSS 

205 210 215 

= = 2 S E 2 2 2 2 S 2 2 2 = 2 = 

225 230 
- ttc aca qqt gaa tac etc ctg egg ctg ttt gee 

S S VaX Leu S £ S S, Glu Tyr Leu Leu Leu Phe Ma 
240 245 

E2 S 222 2 2 2 2 2 2 2 2 2 2 

255 260 

S S E 2 2 2 2 2 2 S 2 2 22 2 2 

270 2/b 

-E222E22S2222222 

285 290 

-E22222E22222222 

305 310 

-S2222E2E2222222 

320 325 

= 2 2 E 2 2 2 2 E 2 2 2 2 2 2 S 
£SS2E222222E2E222 

350 355 

-^22222222222222 



724 



772 



820 



868 



916 



964 



1012 



1060 



1108 



1156 



1204 



1252 



1300 



1348 



1396 



365 370 375 

17 
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££EEE2£2E2E2222E 

385 390 

S 2 S S E s 2 s = s s a = = s = 

400 405 

SSS====SS5SSSK=S 

415 420 

5! S S = S S S S 2 S 5 = 5 = = = 

430 435 

E s 2 = 2 S 2 = s 2 = a 5 = s s 

445 450 

% S C2SS5S5=S2E 2 E = 

465 470 

caa cat cac cac ttg ctg cac tgt eta gag aag aca acg go cat gag 
Gin His His His Leu Leu His Cys Leu Glu Lys Tnr Tnr y^ 
480 485 

S S 2 S 2 S £ 2 = E 2 E 2 S = 2 

495 500 

E 2 2 2 S S S 2 2 S 2 2 2 2 S 2 

510 515 

„ rta tct tct tgc tgc cct cgc agg gec aag cgc cgc 
ccc gga age ctg ctg tct tec tg^ y Arg 
Pro Gly Ser Leu Leu Ser Ser Cys Cys Pro Arg Arg ju. y ^ 

525 530 535 

E222E222E2222E22 

545 550 

=HS52SES=5S=SK== 

560 565 

S 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 

575 580 

S 2 2 2 2 2 S S S S S S 222 2 

590 595 

«« «. »c jee =c. j, g. = - - - - - J E S 
Pro Ala Asn Thr Pro Asp Glu Ser Gin m o ^ 



1444 



1492 



1540 



1588 



1636 



1684 



1732 



1780 



1828 



1876 



1924 



1972 



2020 



2068 



2116 



605 
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S5K5SS=S====SSSS 

625 630 

etc ttc ccc gag act gtc aag ate tea tee c tgtgaggggt aggectgetg 
Leu Phe Pro Glu Thr Val Lys He Ser Ser 



2164 



2215 



640 * 45 



tcatgaaatg gtccacatag tggggaagta 9ccaggaaat 9*9 gcctccttgc 2455 

,„»,.an-ftt cctaatggga gctgaagcac tgggcttcca caggcccctg g>-v-<- a 
Sacac Iccactctcc cagctggact cctgcatgct cctccccttg 2515 

ctqggccccc agcaaactct agcaatagca gctgctgccg tgtcattatg 

=^ttta ctacaqcatt tacatctgcc ctaatcagag gggccacctc taactcctcc 
fcctSctc ttctcctctg gtttgegtec ttcctgggtt gggctggagt ctggactggc 3055 
tSataaga gcctggcaac cagcaagagc tgggctgtat ttggagatca tgggotgatt 3115 
SKSt gggcalcagt ccagaagcat caggggctcc ggcctgggat gtttctgaac 3175 
tttgawtt ataggagaca ggaggaactt ctcctcctcc tcctccccta ^ 

^ctoacaS aScaggtt ccctgtS agcactgtcc ccatggagct ggtggctgac 3355 
SK 2225 tcaaLaaac ctgagaggag agatgaggaa aaaaaaaaaa 415 



aaaaaaaaa 



<210> 10 
<211> 646 
<212> PRT 
<213> H. sapiens 

<220> 

<221> VARIANT 

<222> (1)...(646) 

<223> Xaa = Any Amino Acid 



Met AlaMa oly Leu Ala Thr Trp Leu Pro Phe Ala Arg Ala Ala Ala 

Val Gly Trp Leu Pro Pro Ala Gin Gin Pro Leu Pro Pro Ala Pro Gly 

val Lys Ala Ser Arg Gly Asp Xaa III Leu Val Val Asn Val Ser Gly 

Arg Arg Phe Glu Thr Trp Lys Asn Thr Leu Asp Arg Tyr Pro Asp Thr 

cn 55 
Le « Su Gly Ser Ser Glu Lys Glu Phe Phe Tyr Asp Ala Asp Sar Gly 

Glu Tyr Phe Phe Asp Arg Asp Pro Asp Met Phe Arg His Val Leu Asn 

Phe Tyr Arg Thr Gly Arg Leu His C» Pro Arg Gin Glu Cys He Gin 

Ma Phe Asp "in Glu Leu Ala Phe Tyr Gly Leu Val Pro Glu Leu Val 

Gly Asp cys Cys Leu Glu Glu Jyr Arg Asp Arg Lys Lys Glu Asn Ala 
n 135 I 40 

O, aS Le» *la Glu Asp Glu Glu Ala Glu Gin Ala Gly Asp Gly Pro 



145 

19 
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,a a Leu Pro Ala Gly Ser Ser Leu 

Ota Asn Pro His TnT Ser Thr Ala 

Gly phe Phe He Ala Val Ser Val 
195 

Pro cys Arg Gly ser Ala Arg Arg 

Glu £ «- - «» s Phe phe 

Se Phe Thr Gly Glu Tyr Leu Leu 

Cys Arg Phe Leu Arg Ser val Met 
260 

.eu Pro Tyr Tyr He Gly Leu Leu 
Gly Ala Phe Val Thr Leu Arg Val 
Phe 2 Arg Hi, ser Gin Gly Leu 
Ser Cys Ala Ser Glu Leu Gly Phe 
lie lie He Phe Ala Thr Val Met 

340 

Lys Thr Asn Phe Thr Ser He Pro 

„et £ Thr Leu Gly Tyr Gly 

Gly £ He Phe Gly ser He Cys 

S Leu Pro Val Pro Val He Val 
405 

Gln Asn Gin Arg Ala Asp Lys Arg 

Ma Arg He Arg Leu Ala Lys Ser 

Tyr Lys £ Asn Gly Gly Leu Glu 

Ma £ cys val Arg Asn Arg Ser 

£ Leu His cys Leu Glu Lys Thr 
485 

Leu Thr Phe Ser Glu Ala Leu Gly 
500 

. ser Arg Ser Thr Ser Val Ser Ser 

ser ser cys cys Pro Arg Arg 

Ma Asn Ser Thr Ala Ser Val Ser 

Met Leu Ala Gly Leu Arg Arg Ser 

wu Asn Ala Lys Pro His Asp Ser 
580 

^ p he Val Ala Ala He He Ser 
» Asp Glu Ser Gin Pro Ser Ser 
ser £ i~ Arg Asn Ser Ser Leu 

625 

Thr Val Lys He Ser Ser 



Arg Gin Arg Leu Trp Arg Ala Phe 
,a £ val Phe Tyr Tyr Val Thr 
£ Ala Asn Val Val Glu Thr He 
ser ser Arg Glu G°in Pro cys Gly 

220 

cys Met Asp Thr Ala Cys Val Leu 

Arg Leu Phe Ala Ala Pro ser Arg 

Ser £ He Asp Val Val Ala He 

val Pro Lys Asn Asp Asp Val Ser 

Pbe Arg Val Phe Arg He Phe Lys 

Arg He Leu £ Tyr Thr Leu Lys 

^ Leu Phe Ser Leu Thr Met Ala 

Phe Tyr Ala Glu Lys Gly Thr Asn 

£ Ala Phe Trp Tyr Thr He Val 
3oi> 

„ 1 e er Thr He Ala 
Asp Met Val Pro ser 

S er Leu Ser Gly Val Leu val He 
Ser Asn £ Ser Arg He Tyr His 
„ £ Gin Gin Lys Val Arg Leu 
G ly Thr Thr Asn Ala Phe Leu Gin 

r 445 
M p Ser Gly ser Gly Glu Glu Gin 
460 

Ala Phe Glu Gin Gin His His His 

Thr Cys £ Glu Phe Thr As* Glu 

^a £ Ser Pro Gly Gly Arg Thr 

£ Pro val Gly Pro Gly Ser Leu 

Aia Lys Arg Arg Ala He Arg Leu 

M Gly Ser £ Gin Glu Leu Asp 
555 

Hi9 Ala Pro Gin ser Arg Ser Ser 

Leu Asp Leu Asn Cys Asp Ser Arg 

£ Pro Thr Pro Pro Ala Asn Thr 
605 

pro Gly Gly Gly Gly Arg Ala Gly 

Gly Thr Pro £ !*« *■ ™ £ 
635 
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645 

<210> 11 
<211> 1862 
<212> DNA 
<213> H. sapiens 

<220> 
<221> CDS 

<222> (383)... (1157) 
<223> K+Hnovl5 



=2= ss ™ r - rcS — 

ottttcaaga agcagaaagc "ctgcatac -gM^ at * accaggc tggg gctgaa 
gagctgacag cgcaggcgat 9«^«gcg at ctcaaagcca agaa gaacaa 

taaaggcgtg cttgtgtggt «<*£«°" aaa ata aac aga aga gaa aaa 
gctgaaatag catcttcaaa aa atg gag cg^ ^ ^ ^ ^ ^ Glu Lys 

1 5 10 

E £ S £ E S S = = = = = S £ 5 = 
S5=SS===S=== S ? SS - 

£EESe=5SS = S = £S5SS 

60 65 

75 80 

- ™« aaa ctt eta ttg ccc gaa ggg ttt cga gaa 

SSSSSSES-s — TO * S alu 
— TEEEEES £ S £ S 5 2 

Asn Gin Leu Leu Ala Gin Giu ax* UQ 
110 115 

SB=aSSSS=S==£S=S 

140 145 

£ = S£S=S£SSS = 5:S5 = 



60 
120 
180 
240 
300 
360 
412 



460 



508 



556 



604 



652 



700 



748 



796 



844 



892 
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160 



165 



170 



S S S 2 S = 2 2222 S 2 2 S S 

175 1BU 

-222222222222222 

190 19S> 

1S222222222222222 

205 210 

= 2 K 2 2 2 2 S E 2 2 2 S 2 2 2 

220 225 

S S = 2 = = = S S S = S S = 2 S 

cac age gat gca ctt cat ttt ate a agtaattacc tgtgtcacga 
His Ser Asp Ala Leu His Phe He 
255 

— jsss S22S SSS 222= 

aaataacatg cccagctagc tctgtactac agagee s tt cctac tgatc 

etaaeggtat gtaaattcta tegctaaaga tjepttect JJ^ aagactgaaa 
agactcttcc acctaaaatg aaaacagtaa «"ctata a gggtatttgc 
gettttgeta tttatttgtc cttaagctgt etttcaattc ^9 aaaata tttt 
fcaaaaagaa gcatgtacat tatctatcgt tcatttaagt aaatgg^ gagaaactga 
aagg ggctat taatatttaa "tccttttc tactatggc aa aacagagct a 

actggcaaaa ttaactacct ggagoaaaac ^gtgcag ttttccatag 
tagfgaaaca aaatgagatt gtaagaagac attaaagcta ttg^ ^ 
5S tagaSctg /ttgatgaca ataaaaagta aataaaagca 1337 

ctgctacctt caaaaaaaaa aaaaa 



988 



1036 



1084 



1132 



1177 



1237 
1297 
1357 
1417 
1477 
1537 
1597 
1657 
1717 
1777 



<210> 12 
<211> 258 
<212> PRT 
<213> H. sapiens 



net S. He - Arg Arg GXu Lys GXu Lys GXu Tyr «. -V 

Lys His Asn ser Leu Glu Asp Thr Asp Gin GXy I*. « J ^ ~ 
^ L eu Met Thr Leu Asn Val GXy GXy Tyr Leu Tyr XXe Thr GXn Lys 
61n Thr Leu Thr Lys Tyr Pro Z Thr Phe Leu GXu GXy XXe VaX Asa 
Gly Lys IXe Leu Cys Pro Phe Asp AXa Asp GXy His Tyr Phe XXe Asp 
£, ^ GXy Leu Leu 11 Arg His VaX Leu Asn Phe Leu Arg Asn GXy 
Glu Leu Leu Leu Pro GXu GXy Phe Arg GXu Asn GXn Leu Leu AXa GXn 
Glu .a GXU Phe Phe GXn Leu Lys GXy Leu AXa GXu GXu VaX Lys Ser 



22 
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Arg Trp 
130 
He Thr 
145 

Ala Pro 

Lys Ser 

lie He 

Val Leu 
210 
Phe Glu 
225 

Ser Leu 
Phe He 



Glu Lys 
Asp Asn 
Asp Phe 



Arg Leu 
180 
Gin Phe 
195 

Lys Glu 
Ala He 
Asp Cys 



Glu Gin 

His Asp 
150 
He Ser 
165 

Asp Gly 



Leu Thr 
135 

Arg Ser 
Lys He 
Phe Pro 



Lys Tyr 

Asp Asn 

Met Met 
230 
Ser Lys 
245 



Phe He 
200 
Thr Phe 
215 

Ala Leu 
Gly Ser 



Pro Arg 

Gin Gly 

Lys Ser 
170 
Glu Glu 
185 

Lys Ser 



Glu Thr 
140 
Leu Arg 
155 

Arg He 
Phe Ser 
Glu Asn 



Val Cys 

Lys Cys 

He Val 
250 



Thr Leu 
220 
Gly Phe 
235 

His Ser 



Thr Phe 

lie Phe 

Val Leu 

He Ser 
190 
Gly Thr 
205 

Glu Thr 



Leu Glu 

Cys Asn 
160 
Val Ser 
175 

Ser Asn 
Arg Leu 
Leu Lys 
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<221> VARIANT 

<222> (1)...(256) 

<223> Xaa = Any Amino Acid 



Met Se^Arg Pro Leu Xle Thr Arg Ser Pro Ma Ser Pro Leu Xaa Asn 
An Gly He Pro i Pro Ma Gin Leu Thr Lys Ser Asn Ala Pro Val 
His Xle Asp val Ci y Gly His Met Tyr Thr Ser Ser Leu Ala Thr Leu 
Thr Ly s Tyr Pro Glu ser Arg Xle Oly Ar 3 Leu Phe Asp Oly Thr Glu 
Pro S. val Leu Asp Ser Leu Lys am His Tyr Phe Xle Asp Arg Asp 
Sr Oln Met Phe Arg Sr Xle Leu Asn Phe Leu Arg Thr Ser Lys Leu 

^ ne Pro Asp 2p - ^ * ^ ~ £ °» ^ 

100 d™ Met Leu Leu Glu Met Glu Arg Trp Lys 

Lys Tyr Phe Gin Leu Gin Pro Met Leu Le ^ 
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val S Val Ala Pro Asp Ueu Gly Glu Arg Xle Tfcr Ueu Ser Gly Asp 
£ Ser Leu XXe Glu Glu Val Phe Pro Glu Xle Gly Asp Val Met Cys 
^ ser val Asn £ Gly Trp Asn His Asp Ser Thr His Val Xle Arg 
Phe _ Pr o Leu Asn Gly Tyr Cys His Leu Asn Ser Val Gin Val Leu Glu 
S Gin Arg Gly Phe Glu Xle Val Gly Ser Cys Gly Gly Gly 
val Z ser ser Gin Phe Ser Glu Tyr Val Leu Arg Arg Glu Leu Arg 
S Thr Pro Arg val 2 Ser Val Xle Arg Xle Lys Gin Glu Pro Leu 
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Glu vaX Gly Leu U« Leu Leu Tyr Leu Ser Val Gly XXe Ser XXe Phe 
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aht- aaa aaq qag gag aac gag ggc ctg gcc 
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cattgetgag ctgcctcttg tgcctctggc a-geccagg caccttatgg ttatggtgta l« 
aagagtatgc ccagcccctg aggggagaga ^catggga » gcacctttcc 1915 

eag^tttag aatcgttttt agagggtggt 3tgtctgaea ^atgect 9 ttttccaga 1975 
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ttagaacgat gtacactgat g"catctca taaagt gggagagggg 2455 
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cttggttcac agtcaagect <=cttgtttcc WW tgtctaagat attagcatgt 
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Phe ASP Gly Gin «o Leu Gly Asn Phe Arg Arg 

165 „ -i t»« ser Arg Val Phe Ser He Leu 

^ Asp Asn Pro Gly Tyr Ser Val Leu Ser Arg 

180 „ T i a TieThr Met Cys Leu Asn ser 

S.r n. m» v»i vu « ay •£ Ils ™ 
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<213> H. sapiens 

<Zl £t Arg Gly Leu Leu Ala Pro Gin Asn Thr Phe Leu Asp 
Met Pro Ala Met Ary « i x=» 

1 5 ok G1V Thr His Ser Asn Phe Val Leu Gly 

Or He Ala Thr Arg Phe Asp Gly Thr 3Q 

20 „ Val cys Ser Asp Gly Phe 

^ Ma Ser Gly Gly Ala Leu Pro Val Val Tyr Cy ^ 

3 5 . c 1™ Ala Glu Val Met Gin Arg Gly Cys 

cys Asp Leu Thr Gly Phe Ser Arg Ala ^ 
l Ss Ser Phe L.u Tyr Sy Pro Asp Thr Ser 6 lu L.u Val Arg Gin 
Gin He Arg Lys Ala Leu Asp Glu His Lys Glu Phe Lys Ala G^u Leu 
85 1 fm cvs Leu Leu Asp Val 

n. u» w w s « «» L,u rS 110 

„S»-« — 3""" 

I 65 /«« t \/q His Lvs Leu Asn Lys 

180 „ » 01. Tyr Ly «1 M. M. 
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405 0 Gin Ser Asp Asn Cys Ser Ser Ser Ser 

Ala Gly Gly Asn Ser Ser Gly Gin Ser Asp ^ 
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val Gly Phe Gly Asn Val Ser Ala Asn Thr Asp Thr Glu Lys He Phe 
S XXe Cys Thr Met 2 Xle Gly Ala Leu Met His Ala Val Val P h e 
Gly tan val Thr Ala Xle Xle Gin tag Met Tyr Ala tag tag Phe Leu 
^ His Ser tag Thr Arg Asp Gin tag Asp Tyr XXe tag XXe His tag 
Ile P.O Lys Pro Leu Lys Gin Arg Met Leu Glu Tyr Phe Gin Ala Thr 
^ S Val Asn Asn Gly XXe Asp Thr Thr Glu Leu Leu Gin Ser Leu 
£ ^ Glu Leu Arg AXa Asp Xle Ala Met His Leu His Lys Glu Val 
^ oin Leu Pro £ Phe Glu Ala Ala Ser Arg Gly Cys Leu Arg Ala 
Leu ser Leu AXa Leu Arg Pro Ala Phe Cys Thr Pro Gly Glu Tyr Leu 
Ile His Gin Gly Asp Ala Leu Gin Ala Leu Tyr Phe Val Cys Ser Gly 
Ser 2 Glu Val Leu Lys Sy GXy Thr VaX Leu AXa XXe Leu GXy Lys 
S ASP Leu XXe GXy Z Glu Leu Pro Arg Arg Glu Gin Val Val Lys 
^ tan Ala Asp III Lys GXy Leu Thr Tyr Cys Val Leu Gin Cys Leu 
Gln ,eu Ala GXy X-eu His Asp Ser Leu Ala Leu Tyr Pro GXu Phe AXa 
Pr0 Arg Phe Ser Arg Gly Leu Arg Gly Glu Leu Ser Tyr Asn Leu Gly 
AXa GXy GXy GXy Ser Ala Glu Val Asp Thr Ser Ser Leu Ser Gly Asp 
Zl Thr Leu Met Ser Z Leu Glu Glu Lys Glu Thr Asp Gly Glu Gin 
61y Pro Thr Val Ser Pro Ala Pro Ala Asp Glu Pro Ser Ser Pro Leu 
« ser Pro GXy Cys Thr Ser Ser Ser Ser Ala Ala Lys Leu Leu Ser 
Pro ^ tag Thr AXa Pro tag Pro Arg Leu GXy GXy tag GXy tag Pro 
Gly tag Ala Gly Ala Leu Lys Ala Glu Ala Gly Pro Ser Ala Pro Pro 
tag Ala Leu Glu Gly S Arg Leu Pro Pro Met Pro Trp Asn Val Pro 
pro asp Leu Ser Pro tag Val Val Asp Gly XXe Glu Asp Gly Cys Gly 
ser tap Gin III Lys Phe Ser Phe Arg Val Gly GXn Ser Gly Pro Glu 
^ ser Ser Ser Pro Ser Pro Gly Pro Glu Ser Gly Leu Leu Thr VaX 
Pro Ss Gly Pro ser Glu Ala tag Asn Thr tap Thr Leu Asp Lys Leu 
S Gin Ala val Thr S Leu Ser Glu Gin Val Leu Gin Met tag Glu 
Gly ^u Gin Ser Leu tag Gin Ala Val Gin Leu Val Leu Ala Pro His 
^ Giu Gly fro Cys Pro tag Ala Ser Gly Glu Gly Pro Cys Pro Ala 
Ser Thr Ser Gly Leu Leu Gin Pro Leu Cys Val tap Thr Gly Ala Ser 
S er Sr Cys Leu Gin Pro Pro Ala Gly Ser Val Leu Ser GXy Thr Trp 
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l G iu S Ser Thr Pro Ala ^ Pro Pro Pro S.Glu Glu Gly A. 
„ Gly Pro Ala Glu Pro 5 Val Ser Gin Ala.u Ala Thr Ser hr 
o G lv ser Gly Gly Le« Ala Leu Pro Trp Asp Pro 
Gly Glu Pro Pro Pro Gly Ser Gly * y 1055 

, M He Gly cys His Gly Ser Gly Thr Val 
His Ser Leu Glu Met val Leu He Gly Y 1Q70 



GlnTrpThrrGluGluGlyThrOlyVal 
1075 XU ° 



<210> 21 
<2H> 18°° 
<212> DNA 
<213> H. sapiens 

<220> 
<221> CDS 

<222> (346) ... U057) 
<223> K+Hnov28, splice 



<400> 21 ^ n ttcctccttc tcttaagtac agtatagttc 60 

atttgaatga ctgggttact S£S«tt ctctatggta «J 

tttctctgaa aatcttcagt ^tagttc g a ctttgccctg tagtaggttt i 

acatgtagaa ggccctaggg ^ctttc ttcc cctctgttgg gtggaaaaag 240 

cagctgagca aggacgagta 9"tttctgg g » gtctgtt ttt ctcctcttga 
actttcttct ctattttcct agttatatat 9* ag at g gat aat gga 

agtttccctg aaacctgggc tcttgaagac gcatcactgg g Asn Gly 



405 



5 S K S 2 « = - s s i S = S S ® 

~* t-t-a aca cgt tac ccg gat tec 

s a 5 s s s s = s » - - - - «s s " 

25 

2 £ S S 2 S S S 5 S = = - i - s 

40 

caa ggc aat tac ttt att gat cga gat gga cot ctt ttc cga tat gtc 
Gin Gly Asn Tyr Phe He Asp Arg ab P g5 

ctc „ c I «, • - = c « - £ s Z S £ 5 

Leu Asn Phe Leu Arg Thr Ser Glu be 8Q 
70 

^ oat ttt tac cag att gag ccc 

- ttt ff °Leu £ S 5 K E S Pne Tyr Gin Xle Glu Pro 
Glu Phe Asp Leu Leu Arg ny» 95 iuu 



501 



549 



597 



645 



85 90 
° 38 
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741 



789 



asasssssssssssss 
sscsasssssasssss 

120 

- = = s s s s s = = = = s = s s 

135 tgg aat 837 

C S = = = = 5 = = = S 5 S S *" " 

155 

15 t ttt gga 885 

S = S 2 S = S = S = S = E t - S 

ssss===s== BSS= 5 s 

^ 8 ^ 981 
200 t 102 9 

sssa===== oa =5 5Si: 

215 10 77 
o eta gcc cgg aag aca gac gac t gatctccgac cctgccacag 

230 rt-ttttttta aatcacagtg UJJ 

« r - s s-s ESS -S5S SS 
2SSS 53SS =ss — ^ SSS =2E - 

gcctgttcag agtctcoaga "cctttttt * aaaatgattt <*gataaaat 

Uatctac cott-cjj gcttttctta ^ t « 

S£S t S-aafca "Salgta c = 

tlagtagtat tgtaatatta «ggaaaact f tata aatcaa 99»«»£ l677 

.cittttt acagttatga «»«gtttc « 9^ tgaattctga atacaaaatg 
atctgtagcc atggaaatgt cgactagaa a ^ ataattC cca 

tccctgtggt £SSi aaatgaaata tgaaaaaaaa aaaaaa i8oo 



<210> 22 
<211> I 836 
<212> DNA 
<213> H. sapiens 

<220> 
<221> CDS 

<2 22> (382)... U093) 
<223> K+Hnov2B splice 2 



39 
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60 
120 



cctttagatg tcgctgaact agtt-ctat t tccctgaaac ctgggctctt 

tcaaqaagag caaaactcaa tacaaggc* tat atg atg 

Sgfcgca? cactggagca g «g Jj £ « y ^ ^ Gly Tyr Met Met 

S 5 = S ==SC*S = = S = ! S S 

30 J tfc 55 

» - 5 « £ S S S S S = E £ S - - 

Gly Gly Asp Pne 5Q so 

- - B- s £ s a s ?s s a s a s Hi - 

Lvs Glu Ala Asp P&e ^Y 1 1Q0 
95 

i- a rt ttt qaa gaa gtt gtg gag 

S S 5 S = S = = 3 S "* ~ "* S M ° 1 " 

=sss5S=S 3S== = aE= 

125 

.« .» - - - - s s s s c s c . s £ s s 

ii. u« tm <n» "» °» fit 150 

Sssasssssssssssss 

175 

C S = S = S = = 3 E S S = = = S 

190 

- — SSS S S S 2 E s S 5 E 

Pbe Thr He Arg Asn Thr Arg v 
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- 

- - - ?! IS S = 5S £ 3 S S S S S 

Glu Asn Thr Val Glu His Asn in? 23Q 

220. 1133 
aC a 9 ac gac t gatctccgac cctgccacag g—ggaa agactctcca 
Thr Asp Asp 

Z~— —■—*■ — S2 =225 2SSS ssss « 
£25 £2= ™ ESk SSE= ™ 

jsss ssss ibis ss: s= 

S=S SS3S 3S= IS; 2*K =£S 
S2 «~— --*■— - 



<210> 23 
<211> 1751 
<212> DNA 
<213> H. sapiens 

<220> 
<221> CDS 

<222> (297) ... U008) 
<223> K+Hnov28 splice 3 



<400> 23 ♦.f-aaaaattt gctctgcagt ttgctttaca 

gc.tgtttct «-W« g2X£ JracWtc gtMf 

ggetg.cttg gg«ttg~9t a«tg>M9 | ^, t , „ t tcc«tt aje.tg.it. 
S»4tttgt et»« 3 «t ««>■ otg.ttg.« 

SEE t™ SLSS JU- — — ^ S 

sss==s=s===== SS!;: 

20 25 

= C = = S S S S S S S 5 = = = K 

50 55 

s s s = = - = = ;s = E = = = ^ " 



60 
120 
180 
240 
299 



347 



395 



443 



491 



70 

41 
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s s s = s s = = = = s K s £ ~ = 

S S £ 2 S = 5 = = S = = S = S S 

100 105 

assscseasasssssa 

115 120 

S - £ £ = = B S S = S 2 = = = S 

130 135 

s £ s s = = = = = s s = - s s = 

5 S £ 5 = = = = I = 5 S = S = £ 

165 170 
act ttt gga ccc tgt gat tat cac cag gaa gtt tot ett agg gtc cac 
Thr Phe Gly Pro Cys Asp Tyr His Gin * „„ 
180 185 

,^rt- n-f aca ate cgc aac acc egg 

£ S K £ £ S £ S S £ = S =. — 

195 200 

s £ = a s s s = - = i s s s s: s 

210 g 

• - £ 3' £ £ K £ £ S S 5 ' 

Trp Thr Phe Cys Arg Leu Axa j 

* 230 2J;> 

cctg eeaeag gtteetggaa agactctcca ^a a gactgatt ttttttttta JJJJ 

aatcaeagtg tgagatattt tttttetttt aaatag g cc ttccC cctga 1198 

gaccagaagg aagttttgtg «*ttgg«9» ^etttttt ataaaaagaa gtctgaaaat 1258 

Itatgeatgt gcctgttcag agtctccaga taeetttt aaaatgattt 1318 

eattatggta tataatetac ccttaacaga S^tttctta w ^9 taagc 1378 

ctgataalat ggtccctaac tcaactagaa 99«»a»t acag s_ ctagtttcca ^38 

agagtactca tgatgccttt ^aaaact fttfcL'tca tttaaaagta 1498 

aaccaataaa taagtagtat tgtaatatta * ttctat gca tataaatcaa 1558 

S££S £££' ■•£££ £«£» ™< t ££ i 
£££? £SSS == SSS — iS 



635 



683 



731 



779 



827 



875 



923 



971 



aaaaaaaaaa aaa 



<210> 24 
<211> 1B42 
<212> DNA 
<213> H. sapiens 

<220> 

42 
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<221> CDS 

<2 22> (88)... (799) 

<223> K+Hnov28, splice 4 



<40O> 24 aecaccgccg cgcatttccc tgaaacctgg ^0 

cgggcatctc eeggeeegge WfSg'S^t |ga gac tgg ggc tat atg 1M 
gltcttgaag acgcatcact ggagcag atg ga^ ^ ^ Mp ^ Qly ^ Me t 

1 5 

162 

s s s = s = = = s s i s s * " >,s - 

30 

45 

ate etc aac ttc tta aga 

s S s s s s s = = - « - - - w 

60 ^ tt 

S £ E S = = S = « £ s i £ S = L " 
SSE=C=S= sa S= SSS S 

90 »- a rf ttt qaa gaa gtt gtg 

110 

* tac tec aac cca gtg get 

125 

tc cat tec tta eta 

gtc ate ata acg caa eta aee ate aee act aag ^ ^ Ser ^ ^ 
Val He He Thr Gin Leu Thr lie t lSQ 

140 

„ faa aat aag cac atg atg gac 

155 

ss5S=S= sss a= sss S 

S E S = S S S S 2 = = - £ s S = 

190 

* ~ - - -SS5 S £ = S 55 S SI S 

Sy P^e Thr He Arg Asn Thr Arg ^ 215 



258 



306 



738 



205 

43 
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786 

-s==as==5S«5= ss 

220 839 

.„ ,.c - . — — ~° gttcctggaa ^— 
Lys Thr Asp Asp 

235 _ w^-^t-ttt B99 



235 tttttctttt B99 

gctaaaaat acaagaatga aagaataagc ^ caag 

Saacatcat gtagggtgac ctagtttcca act? 

aaggaaaact gttcoaatca tttaaaagta t atct 

Lfctgtttc tctctatgca f tacaa aatg tec, 

ctgactagaa atatttatat JS^ agt gtactg t eta. 

etctttatge <*ggtg«gt at aaaaaaa aaa aaa 
aactaataaa aaatgaaata tga 



<210> 25 
<211> 237 
<212> PRT 
<213> H. sapiens 



mc ^"z v» « » « * ~ y * " p p " v " E « 
X«««. sr 

35 rt « Qiv Asn Tyr Phe He Asp "g Asp Gly Pro 

na AM Asp Pro Gin Gly Asn iy 

T S J v. — s. — ~ 81 ° - m " - 
« - X w , ao s. „ — s- - - w r ^ 

::: :g;:::3r.::.:2= 

" " 3 „„ ,„ » u . vi u. ... S - - ~ 

Thr Lys Trp Asn bys ^ 

195 • ser Glu Arg Ala Asn Glu Asn Thr Val Gi 

Arg val His Has Met Ser eiu a 22Q 

210 ^ Leu Arg Lys Thr Asp Asp 

Asn Trp Thr Phe Cys Arg Leu Ala ^ 



230 

225 

<210> 26 
<211> 3204 

44 
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<212> DNA 

<213> H. sapiens 



<220> 
<221> CDS 

<222> (182).-. (1349) 
<223> K+Hnov42 



"sF5 si si s s £ s = s - » - - -s - 

1 5 271 

aessssssssassass. 

20 32' 
. hat . aat qqg aaa ggt gga J * 

2 s = 2 s 2 E 2 2 a 2 - - - - « 

35 37 

- - - 2 S S 2 S - S 2 2 S 2 2 !» 

Leu He Asp Asp He JU.a «* 60 

50 55 4 - 

2 E 2 S 2 2 2 2 2 2 S 2 2 2 2 2 

65 70 

t-oo ctcr aca tta aat gtt gga 

S 2 2 2 2 2 2 2 222— - ~ « 

ggg egg C3C « - = « » £ S S S S K ^ 

100 

,« * .tg eg gec = £ £ - J J * 2 2 2 S 

Ser Met Leu Ala Hxs Met Phe uy v U5 
115 

_ . g « r s r s - £ 2 2 2 2 2 2 2 

Lys Gin Asp His Arg Gly Aia r 14Q 
130 135 

22 22 222 2 2 22 S = SB U 

145 

2222222222222222 
S 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 

180 

* cat tea cca ata tec ega aag gaa ttt gtc ega ttt 
cca ccg gag gat cat tea cc 



517 



565 



757 



805 
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997 



« ri* Ser Arg Lys Glu Phe Val Arg 
Pro Pro Glu Asp His Ser Pro lie Ser Arg y ^ 

195 

„ +na oaa eta cga tgc cag ggt ttg aac 
ttg eta gca act oca acc aag tea gaa ctg ^ £ ^ ^ Leu ^ 
Leu Leu Ala Thr Pro Thr Lys * ^ 
210 215 

S S S S 5 = S 5 = S £ S £ S = £ 

225 230 

- - ff £ £ S 5 S £ £ s - K S " 1 " 

Lys Met Ala Asn Leu Ser Arg ^ 255 

245 

s s £ £ £ £ £ S £ £ S £ « £ £ 5 

260 

~*r* hat tct aat gca gaa gga gca 

s £ £ £ c = s s £ £ - - % ~ « - 

275 

- - - - £ £ S S S £ £ S S S K £ 

Ser Leu Lys Leu Cys Asn Phe Giu P ^ 
290 

-ESE = 5SSSaSS = S = = S 

305 310 

SS S=SSSS=sS£ES=S 
= 5===BSS==SSS=== 

340 

55S=SSS====== S5= 

355 360 

- - sr s S £ £ S £ £ £ £ £ £ »' 

Asn val Lys Gly Ala He Phe w. 3gQ 



853 



901 



1045 



1093 



1237 



1285 



1333 



375 

370 



«. . m *. » . — ■ — — "*" C9 "" 
Ser Gin Ser Val Arg 

rtat 1449 

w « e =c« j-jg «S5 £££* »- 

SS5SS ££g «U£ -ss 2S5S 
2352 SSSS = 35b =£5 £££» H 

^gaattcc tagatgeagt atggatattt aaattgt tacctg tcac 1869 

aaggttgttc aggtttataa atagctttag tga g ttcaggttca "tttataat 1929 
SSEC eS?- a tt Lag t aa t a t ccag tt c «ac tg a ttg 
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^-s ssus sss ™ EH 

ttagcctagt "tctcatac tcaaa t aC ca aatagtttac ^9 2349 

accaaggtgc accauw „„ art;ia etac tgcaatcaaa gcttatatga y * 2 409 
cttagaaata gatactaaat t Lttgcagc tagaacccta 

tagagttgcc tgctaaaagg «^gt caaaa aa atagacgt ^aaataaag 

tg aggaattt t^acacaca tttJJ^ t t g a tt ctttta^ ^ 

gcatttggct 3g«caag 9 aa tttta gccca ftttctcga g 2649 

tttttttgtg gtttttcw tccctaccaa tgacttggtg cct 2709 

ctctgcagca goagaggaag 39^gtacc ^ tctgaattc t acagac«at 2 

ctaccccaag agcagggata ttagctg g tc ctttggg aga areata 

caacatgagg caaggaatca "gaaaa atatg caaagc tttc ="aacagc 

SSStct attttcataa ****** ££££ ^ 069 

aaaaaaaaaa aaaaa 



<210> 27 
<2H> 389 
<212> PRT 
<213> H. sapiens 



« „-= s. - - * - - y - - * : s r 

t «. - « - * «. « - « - - y Ly * w 

7-"; 

I yl— - 7- — HE" 

85 ™. »™ <?er Thr Leu Val Asn Lys Glu Pro 

Gly Arg Tyr Phe Thr Thr Thr Arg Ser Thr 

100 a. ov,^ tvs asp Lys Gly val Trp Gly Asn 

" 5 „ 0!V M. "« OT M S " 

L y Gin MP His Arg <Hy "| 

«. S » » — S - «"* g ~ - IU 5. 

s- I 

L ow s - - - a - ws olu s val *" 
I » s s , - » — . = - - - s 010 sly " 

s « u. « - s « - - s - - - - s r 
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270 

260 , r » Met Leu Cys Ser Asn Ala Glu Gly Ala 

Ala Asn Leu Gin Gly Val Lys Met Leu cy ^ 

275 p. Phe Glu Asp Pro Ser Gly Leu Lys Ala Asn 
Ser Leu Lys Leu Cys Asn Phe Glu P ^ 

„» S -r - - - 5 « v>1 " p S 01 " 81v « 

ASn ^ 3 40 rtn Glu Ala Asn Leu Arg Gly Ser 

Asn Val Lys Gly Ala He Pne 38Q 



3 75 

370 

Ser Gin Ser Val Arg 
385 

<210> 28 
<211> 1246 
<212> DNA 
<213> H. sapiens 

<220> 
<221> CDS 

<222> (432)... (1092) 
<223> K + Hnov44, splice 1 



ids ss= =ss sr5rs.-sr=Ss*=2 

accgagccgt g atg etg « «J ^ Met Met Gly Phe Ser Val Leu 



5 



15 

. „ _ ac aca gat ate atg gac 

30 

55S5SESSSSa=S=S= 

sss=ss=a=a== s: 5=* 

65 

2 S E = S S S = E = S = = = = S 
^,1 — - — — - — 



60 
120 
180 
240 
300 
360 
420 
470 
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95 ttc ttc gat cac aaa aat gga act 806 

S S = 5 = S * = E 5 = = E S S S 

QQ2 

4. cac tqt tta ttt tgg ™* 

145 ~ 950 

S £ 2 S 2 = S S S 2 £ S S = ;s S 

I 60 tc 998 

£S££2222S££iS£S« 

175 ttc 1046 

• - ? C S Z 2 S 2 2 £ S 2 2 - - 

Arg Asp Glu Val Gly Giy 200 

190 tct t 1092 

- - - £ 2 2 2 S 2 S 2 S S 2 s 

Lys Leu Cys He Met Arg 2lg 

210 1152 

— -SSS 2SS W S5S3 =t 12 
3S2 222. ■ - 

<210> 29 
<2H> l^ 1 
<212> DNA 
<213> H. sapiens 

<220> 
<221> CDS 

<2 22> (297)..- (957 
<223> K+Hiiov44 f splxce 2 

— ~ 35= 5SS 2S2J 35S 3 
£ S S S S » S 2 2 2 2 S 2 2 2 = 

ss=sssssa=ss=5== 



347 



20 

49 
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r<at ate atq gac gac tgg ctg gac 
tcg acc tgc act gee ate eae aea gat ate atg 9 9^ ^ Leu Mp 
Ser Thr Cys Thr Ala He His Thr as P ^ 

35 4 

s s s s 5 s a = - = s = ' s s 'I 

„ cca aqt cag aaa get etc .1 ! 

* £ S S E 52 E S = S = - - - E M " 

^ y 70 

-sss==scas==ss=s 

85 

a ^ aat ttg etc aac agt get ctg 

100 

5 £ S E £ £ S = E = S S E = = S 
« S £ = E = = = = S S S S S S = 

130 135 

ht-at ttt tgg cct tea ctg act 

- - S S E 2 E E S E E 5 ~ - - - 

Tvr Asp Gin Met iu.<* ±± lg5 
* 150 

S S S S K = E S S S EE EE = = 

S E E 2 s = s E = E C S 5 s = s 

180 

- - r e s E s s a e a e e e s e 

Gly Gly hya val Pro Tyr u 2Q5 
195 

«ra aaq aaa tct t aagacggtgg 
atg agg agg age aaa gga aga gea gag ^ 
Met Arg Arg Ser Lys Gly Arg /u. ^ 
210 

SEES =1 EEIE SEES =SK = 

aaaaaaaaaa aaaa 



587 



635 



827 



875 



923 



CCtgtcty^« ~- - 

aaaaaaaaaa aaaaaaaaaa aaaa 

<210> 30 
<211> 220 
<212> PRT 
<213> H. sapiens 



<400> 30 _ Q _ r val Le u Met Phe Phe Leu 

„et Leu Gly Phe Ala Met Met Gly Phe Ser 
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» 5. - - - * - - s,r "* KO 01y 01 ° ly * " 

65 !? r1n Ma Val Gin He Asn Pro Lys Cys Phe 

Leu Leu His Tyr Asn 61u Glu Ala Vai „ 

85 , » Asn Asp Leu Leu Asn Ser Ala 

Tyr Thr Pro Lys Cys His Gin Asp Arg Asn Asp UQ 

1°° t- t«. a«m Glv Thr Pro Phe Ser 

180 nwoTvr lie Glu Gin His Gin Phe Lys Leu cys 
val Gly Gly Lys val Pro Tyr lie a 205 

Ile Met S - * £ *» ^ ^ S 
210 

<210> 31 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> consensus sequences 

<400> 31 
tatccacatc aatggacaaa gc 

<210> 32 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<400> 32 
tgcataactg gctgggtgta 

<210> 33 
<2U> 22 
<212> DNA 

<213> Artificial Sequence 

<400> 33 
tgacatcact ggatgaactt ga 

<210> 34 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<400> 34 
tgcctgcaaa gtttgaacat 

51 
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<210> 35 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<400> 35 22 
tgacatcact ggatgaactt ga 

<210> 36 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<400> 36 20 
tgcctgcaaa gtttgaacat 

<210> 37 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<400> 37 20 
acctggtggt atggaagcat 

<210> 38 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<400> 38 19 
tttctcctgg cctctaccc 

<210> 39 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<400> 39 19 
tccctcttgg gtgaccttc 

<210> 40 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<400> 40 20 
atctttgtca gccaccagct 

<210> 41 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<400> 41 24 
aggtgtgctg ccatctgctg ttcg 

<210> 42 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

52 
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24 

<400> 42 
agcctatcct ctctgagagt cagg 

<210> 43 
<2H> 21 
*212> DNA 

<213> Artificial Sequence 

21 

<400> 43 
aagcagagta 

<210> 44 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

20 

<400> 44 
tctggtagac agtacagtgg 

<210> 45 
<2U> 20 
<212> DNA 

<213> Artificial Sequence 

20 

<400> 45 
catttggctg gtccaagatg 

<210> 46 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

20 

<400> 46 
agtcattggt agggaggtac 

<210> 47 
<211> 20 



<212> DNA 

<213> Artificial Sequence 



20 

<400> 47 
catgcttcta cagtccagcc 

<210> 48 
<211> 20 

2£ Sificia! Seguence 

20 

<400> 48 
ggtcctcagt tgcagaaatc 

<210> 49 
<211> 45 
<212> DtNA 

<2 13> Artificial Sequence 

45 

<400> 49 aactatgggg acatg 



tggtgggctg 



tStgaccat gacaactgtg ggctatgggg 



<210> 50 
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<211> 45 
<212> DNA 

<213> Artificial Sequence 
<400> 50 



tggtgggcag'tggtcaccat gaccactgtg ggctacgggg acatg 



<210> 51 
<2U> 45 
<212> DNA 

<213> Artificial Sequence 
<400> 51 



tggtgggcag'tcgtctccat gacaactgta ggctatggag acatg 



<210> 52 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<400> 52 



tggtggg^tStaaccat gacaacagtg ggttacggcg atatg 



<210> 53 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<400> 53 



tggtgggctg%g 3 tcaccat gacgaccctg ggctatggag acatg 



<210> 54 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<400> 54 



tggtgggggg'tStcacagt caccaccatc ggctatgggg acaag 



<210> 55 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<400> 55 



tggtgggcag'tggtcaccat gaccacggtt ggctatgggg acatg 



<210> 56 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<400> 56 



tggtgggccg'tggtcaccat gacgaccctg ggctatggag acatg 



<210> 57 
<211> 45 
<212> DNA 

<213> Artificial Sequence 



<400> 57 



54 
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tggtgggctg 



tggtcaccat gacgacactg ggctacggag acatg 



<210> 58 
<2U> 45 
<212> DNA 

<213> Artificial Sequence 
tggtgg ; 4 c°trtS tg acca t 9 acaact g t g ^ acat g 

<210> 59 
<211> 47 
<212> DNA 

<213> Artificial Sequence 

^i*- ■ att999tat9 9CttcC9 

<210> 60 
<2H> 47 
<212> DNA 

<213> Artificial Sequence 

ttttt^U*" a.aaacoacc 

<210> 61 
<211> 47 
<212> DNA 

<213> Artificial Sequence 
^cctc^ccUa.ac cca.acaacc ata gg ctat g ^ 

<210> 62 
<211> 47 
<212> DNA 

<213> Artificial Sequence 

tt a«^ > cS^ «^ atC9gCtatg " ttCC9 

<210> 63 
<211> 47 
<212> DNA 

<213> Artificial Sequence 
^ct^ccc^aatc ccaaaccacc a ttgg cta tg .cttcc. 

<210> 64 
<211> 47 
<212> DNA 

<213> Artificial Sequence 
^ctc^rc^aatc cca.acaacc .a.tcc. 

<210> 65 
<211> 47 
<212> DNA 

55 
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<213> Artificial Sequence 
ttcctt^rccLtgaggt ccaagtgact attggctttg gggggcg 



<210> 66 
<211> 47 
<212> DNA 

<213> Artificial Sequence 



tttctcrt^cattgaagt tcaagttacc attgggtttg gagggag 



<210> 67 
<211> 50 
<212> DNA 

<2l3> Artificial Sequence 



gcgctctact 



^tcLcttcag cagcctcacc agtgtgggct tcggcaacgt 



<210> 68 
<2H> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> consensus sequences 



<400> 68 val 6ly Tyr Gly Asp Met 

Trp Trp Ala Val Val Ser Met Tnr ^ l5 



1 5 

<210> 69 
<211> 15 
<212> PRT 

<213> Artificial Sequence 



^ r££ Val VaX Thr Met Thr Thr Jeu Gly Tyr Cly - ~ 



1 5 



<210> 70 
<211> 15 
<212> PRT 

c213> Artificial Sequence 



Trp «?S vai Val Thr val Thr Thr Ue Gly Tyr Gly Asp y 



1 5 



<210> 71 
<211> 15 
<212> PRT 

<213> Artificial Sequence 



<400> 71 , ^ upt Thr Thr val Gly Tyr Gly Asp Met 
Trp Trp Ala Val val Thr Met Thr Tnr ^ i5 

1 5 



<210> 72 



56 
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<2U> 15 
<212> PRT 

<213> Artificial Sequence 



». JZ ~ n. «. « - ~ y » ay Ih * Mr S y 



1 



5 



<210> 73 
<2U> 15 
<212> PRT 

<213> Artificial Sequence 



* «T£ S. - « « - - ? Ile 81y ** ay » 

1 5 



<210> 74 
<211> 15 
<212> PRT 

<213> Artificial Sequence 



<4 °p£ Ser He Glu Thr Glu Thr Thr He Gly Tyr Gly Tyr 
Phe Leu Phe ser aa« « 10 



5 



<210> 75 
<2H> 15 
<212> PRT 

<213> Artificial Sequence 



« - - « - - - IT n * 81r ** °' y » 



i s 



<210> 76 
<2H> 15 
<212> PRT 

<213> Artificial Sequence 



PM - « - »■ «* " oly ? 



<210> 77 
<2U> 15 
<212> PRT 

<213> Artificial Sequence 



<400> 77 ^ ne Gly Tyr Gly Phe 

Phe Leu Phe Ser Leu Glu Ser Gin Thr Thr x5 



1 5 

<210> 78 
<211> 15 
<212> PRT 

<213> Artificial Sequence 



<4 °phe Ser He Glu Thr Glu Thr Thr He Gly Tyr Gly Phe 
Phe Leu Phe Ser ue « 1Q ^ 



5 

57 
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<210> 79 
<211> 16 
<212> PRT 

<213> Artificial Sequence 



<400> 79 ^ q e r Ser Leu Thr Ser Val Oly Phe Gly Asn 

Ala Leu Tyr Phe Thr Phe Ser Ser Leu ^ lg 



1 



5 

<210> 80 
<211> 2571 
<212> DNA 
<213> H. sapiens 

<220> 
<221> CDS 

<222> (110) ... U285) 



z s a = s E K a = = s e s a = a 
a s s a a s e s a s s s = = e s 

25 

20 

= s s s a s s a s a = a 2 = s - 

40 

s a a e e = s a s s a a s s s a 

55 

nta ctt qct aac get teg ggg tec 
gcg gec gga fg ctg ggg egg gte gtg ctt ^ ^ ^ ^ Qly Ser 

Ala Ala Gly Arg Leu Gly Arg vax gQ 
70 

s = e a s s e a a a e s e a s a 

85 90 

-„ tat aaq tac aca acg cca ctg 

e = a a a sssss5v**» - 

ioo 105 

s a e s a e s a s s a a a = s a 

2SSSS=S=S= S = S S =S 



60 
118 



166 



214 



262 



310 



358 



406 



454 



502 



550 



135 140 
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-2S=a=s=s==ssss;s 

150 155 

« >r~r. rac ttci qtq gee ctg ttg ggg gtc 

5 S S S H S 5 S S 2 « E - - « « 

165 

n<- a oto ccq qct gtg ate ttt gec cac etc 

C S S S 2 - 2 O 2 E W n, p to «. - 

s s e s = s s s s s s s s s s = 

200 2Ub 

- - S S S 2 5 5 2 a S 5 a s s s 

Leu Ser Thr He Gly Leu t*iy »*v r 225 
215 220 

- - 2 SS SS3S2SS s 2 s s 

Gin Pro Tyr Arg Ala i*eu xyi ^y* 240 
230 235> 

2 S 2 SI E 2 S 2 S 2 2=25 = « 

245 250 

sss=s=ss= s s ==s= s 

cct ™= « s s s s ~ 3 ?g £ S S S 

Pro Ala Ser Phe Asn Ala Asp Glu Asp asp a 29Q 
280 285 

c=c cag ccg gag tog cac cag caa etc tct gec age tec cac a^ gac 
Pro Gin Pro Glu Ser Hxs Gin Gin Leu Ser Ala ^ 
295 300 

- ff r s 2 s " 9 2 s a s 2 2 s 2 s 

Tyr Ala Ser lie Pro Arg * 320 

310 31b 

BSS 2 B 2 BSSSSSSSSS 

325 330 



agg aat gtc eae gag cac age agg tga tct tga ggc ett gee gtc cac 
£q Asn val His Glu His Ser Arg Ser wy ^ 
340 345 



598 



646 



694 



742 



790 



838 



886 



934 



cgt etc tec ttt gtt tec cag cat ctg get ggg atg tga agg gca gca 
£g Leu Ser Phe Val Ser Gin His Leu Ala Gly Met Arg 
355 360 

= £ S 2 S 2 2 E = 2SS = 2 = = 

370 375 
tte tct gtc ctt tct ctcatcctct ttacaetgtg tctctctggc tctctggcat 

CO 



982 



1030 



1078 



1126 



1174 



1222 



1270 



1325 
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Phe Ser Val Leu Ser 
385 



1385 
1445 
1505 
1565 



tctcgCt gcc tctgtctttc c„ t~ t„ 

ass ssss ss= ~ ssss ess 

cttgatttcc tcagctgcca aatgggaaga atagaagaat £9 actgC cctta 1625 

tgtgctggcc ctgtgctaga cagtgctgga 3acatagttg gC ccaaaatg 1685 

SSkSbo agtccagtga ggtggacaga cctgtcccca Lagggctga 1745 

gfcaggactt taatggagga ^aggtgt tgaaagca^ 9g ggacttccta 1805 

agtcggagaa gcatagggac taggcccaat ccagcc gg J» aagttgtaoc 1865 

gagSaggga catcgaacta agacctga^ tatgagaaat jjj^ ^ 1925 
tgactcattt ttctcaggtg tctccaggga 9 ** gaaacagaca taggacccca 

gcctgggcga tagactcttc ctcagcagcc tggcaggcag g 9^ gtaagagctg 
gcccagatct gaatggcatg 99^99^9 c ™£ gctcaaatcc caaottagco 

Iccacatttg tatgttgtgc cctggaatca gcctggt g * ccttatctgt 
acgtctggcc tgtgtccttg 99^9tcaca <g™£ g gc tgagtgt ggtggctcac 
aaaatggtga tcatcataat acaacttcaa aagg * cgcttgagg ccaggagttt 

gcctatacac ccagcacttt ggaaggctga gga gg sa acaaccacaa aatctaaaaa 2345 
lagactagcc taggcaacac agtgaggcct ^tctcaaca aca 2405 
X,ctgggt gtggtggtgc atgcctgtga tcctggac ttcaggg^ 9^^ 
9a TcaS SSS SSS Kaalgaga gaaagaaaga aagaaagaga 2525 
SSS gSaaga aagggaaaga tggaaggaag gaagga 



<210> 81 
<211> 388 
<212> PRT 
<:213> H. sapiens 



i » « - « ^ » - « ». « - - y - 

— y- — 
«. „ 3 4 » «. « - - - - - 7 ™ " G1 " 
Si «. «.'•!■ «» « - 0 » "° V M "* "* "~ » 

2, oi, «. »» s. s« «»s">»»" te Ba r "* 
^ PM „ i. s. - - s ~ « ■* * s ** * 

. „ » £ - «. « s - » - S ~ M " 

_ S u. - » ~ S. « » - = «. - - «- 3 
S civ «p »• - S - - « s « Bi "" ,sl S " 

» CIV VI V.1 S * V.1 „ £ - « »• - « « "* 
U . «. « £ ClU «. «P J. «- » «P «• -J 

n. U. » ^ n. CIV » civ « tvj v.i » civ ci. 
u . K CIV *. >. Wr S «• - tvr IV. Val - v^ T« v.1 



1925 
1985 
2045 
2105 
2165 
2225 
2285 



225 

60 
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n » 1a Met Val Leu Val Leu Gin Thr Phe 
qvr Leu Phe Leu Gly Leu Val Ala Met Val Le ^ 

245 «• B Glv Leu TnT Glu Leu lie Leu Leu Pro 
lra His val ser Asp Leu His Gly Leu 270 

260 PKe Asn £a Asp Glu Asp Asp Arg Val Asp 

Pro Pro Cys Pro Ala Ser Phe Asn Ala Asp 

2« „, JL r His Gin Gin Leu Ser Ala Ser ser 

He Leu Gly Pro Gin Pro Glu Ser His Gin ^ 

290 ™ Ala ser X Pro Arg Leu Gly Gin Pro Leu Pro Gly 
His Thr Asp Tyr Ala ser 

310 , rlv Ago Gin Ser Trp 

£ «r « - "» - « - " S u 2„ 

I I " s: - - - s = -* Gly " a "* "* T 

370 

Ser Val Leu Ser 
385 

<210> 82 
<211> 3300 
<212> DNA 
<213> H. sapiens 

<220> 
<221> CDS 

<222> (50)... (1285) 

-«*£r£— — " t3 "" M * = s E 

-5 = SSSS===== =S:SS 

s=s==s==a=s==a=5 

CSS s s s S S 3 s s c s s s = 

40 

, ~r*n a CC atq ttc aaa gca 

«SSSS=S==5==SSS= 

70 

- - s s s = = s = s e = = s s " 

Lys Gin Thr Phe lie ser v,^ 95 

s c s s s = s s = = s s s s s 



58 



106 
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100 HO 115 

cca tta gga aac acc tec aat caa ate agt cac tgg gat ttg gga agt 
Pro S Sy Asn Thr Ser Asn Gin He Ser His Trp Asp Leu Gly Ser 
120 I 25 

+nr t-tc ttc ttt get ggc act gtt att aca acc ata gga ttt gga aac 
£ Phe Pne Phe La Sy Thr Val He Thr Thr He Gly Phe Gly Asn 
135 140 



a i-n tea cca cgc aca gaa ggc ggc aaa ata ttc tgt ate ate tat gec 
Se Ser Pro £g Thr Glu Sly Gly Lys He Phe Cys lie lie Tyr Ala 
150 " 5 160 

tta ctq gga att ccc etc ttt ggt ttt etc ttg get gga gtt gga gat 
Leu S G?y He Pro Leu Phe Gly Phe Leu Leu Ala Gly Val Gly Asp 



165 



170 "5 



caa eta ggc acc ata ttt gga aaa gga att gee aaa gtg gaa gat acg 
G?n ly Thr lie Phe Gly Lys Gly He Ala Lys Val Glu Asp Thr 
1B0 "5 190 

att aaa tgg aat gtt agt cag acc aag att cgc ate ate tea aca 
Phe £ S 5 As* val Ser Gin Thr Lys lie Arg lie He Ser Thr 
200 205 

ata ttt ata eta ttt ggc tgt gta etc ttt gtg get ctg cct gcg 
S Se Phe lie Leu Phe G?y Cys Val Leu Phe Val Ala Leu Pro Ala 
215 220 

ate ata ttc aaa eac ata gaa gge tgg agt gee ctg gac gee att tat 
He He Phe Lys His He Glu Gly Trp Ser Ala Leu Asp Ala He Tyr 
230 235 240 



ttt gtg gtt ate act eta aca act att gga ttt ggt gac tac gtt gca 
Phe Val Val He Thr Leu Thr Thr He Gly Phe Gly Asp Tyr Val Ala 
245 250 2S5 

ggt gga tec gat att gaa tat ctg gac ttc tat aag cct gtc gtg tgg 
K ^ ^r Asp He Glu Tyr Leu Asp Phe Tyr Lys Pro Val Val Trp 



260 



265 270 



n-r toe ate ctt gta ggg ctt get tac ttt get get gtc ctg age atg 
Pne S He Zl Val Hy Leu Ala Tyr Phe Ala Ala Val Leu Ser Met 

280 285 



«« a aat taa etc caa gtg ata tct aaa aag aca aaa gaa gag gtg 
S 5 S S Zl Z S He ser Lys Lys Thr Lys Glu Glu Val 
295 300 

rtrra aaa ttc aaa gca cac get get gag tgg aca gee aac gtc aca gee 
E g!u Phe 5 Su His La Ala Glu Trp Thr Ala Asn Val Thr Ala 
310 315 320 

++r. aa a aaa ace acra aqq cga ctg agt gtg gag att tat gac aag 
C£SSSSSSl^-« Val Glu lie Tyr Asp Lys 
325 330 335 

ttc cag egg gee acc tec ate aag egg aag etc teg gca gaa ctg get 
Phe gS Arg Ala Thr Ser He Lys Arg Lys Leu Ser Ala Glu Leu Ala 
340 345 350 

62 



442 



490 



538 



586 



634 



682 



730 



778 



826 



874 



922 



970 



1018 



1066 



1114 
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1162 
1210 
1258 
1305 



s========s ss== s a 

z s s = s = = s s 2 s 2 = s s s 

375 

- * - - 2 S S 2 = = S 5 S S 3 °" S 

Glu ser He Tyr beu Asn Gly w« 400 

390 

aaa tag ccctctcttt aaataacctt 
att get gtg att gag aac ate aaa tag cc 
iS !L Vallle Glu Asn lie Lys 

aggc^cc ataggtgagg ac^ X 

tSLttgtg catgagctca aaaata^ tgatgaCgct tgttgaacgg 1485 
catcaagaga atttggaatt ctgagccagc & tgcctttttg tgcccagact 
tcoactttct ttgatgagtg gaacgaws „_ cctca g aatgaatgag aattgtttct 

gt'tcctct ctctttccct aatgtgccat aaggcetc^ tacc taactgagee 1665 

Lt-aacaatg tagctttgag ggatcagttc ataaatgaca agaaattctt i#« 

££5 -atttatgg atgacaacaa «tt g ttttt gta^ 5 

atgeagcett "acctaaga aattttctj JJ^ cctaggctC a ^Jg* "° 5 
tctctagcta atgtgtggtt tctcctu * t gga aacagtg tgtaaaatg* 

cttttacccc agttctccca "tgaatacc J*^ agat ggacat tgaagcaaca 1965 

tctatctatc tatctatcta tctatctatc "Wtatcta gtgt gaatta 2325 

g^tctgtg ggtatacttt tggagttgtg acttgg ^ | ttc atctctgaca 2625 
Latcactgt gaetttgeae atggaaaaat gcag a gtttgctgtg tttttttttc 26 
Sagagaaaa agctgtwta £££££ ttaatt?tlt atcctgtaat "tttggatg «« 
acataaaaga ggctgattat "tttttag ta aattgetace 2865 

qttccaagat tcagaaaaaa "cagtaaat gcac «3 attttta aaa tgcagaaaga 
SttwtLt tagatctget gtacattgta "tatatata gtctgggggt 
"ataatttc cctaaatata attgeaaact 3 a tttctttt acccagaatc 
gggagttSa tctgaataag tggcattcag «gg£ ^tttaaat ttaagctaga 
^aLagaa gcaaataaac taatagaege «•£ aagttaaa aa aaatccaacc 

2ES S£S 3SS -TSTc t-aattgt ^ 
£S5S S=3E " C a 2— aaaaaaaaaa aaaaaaaaaa 



2925 
2985 
3045 
3105 
3165 
3225 
3285 
3300 



aaaaaaaaaa aaaaa 



<210> 83 
<211> 411 
<212> PRT 
<213> H. sapiens 
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25 30 



Val Glu Ser Asp Thr Thr He Asn Val Met Lys Trp Lys Thr VaX Ser 

Thr tie pL Leu val VaX Val 2» Tyr Leu He lie Gly Ala Thr Val 

Phe 5s Ala Leu Clu Gin Pro His Glu Xle Ser Gin Arg Thr Thr lie 

Si Xle Gin Lys Gin Thr Phe Xle Ser Gin His Ser Cys Val Asn Ser 

Thr Clu Leu Asp Glu Leu Xle Gin Gin Xle Val Ala Ala Xle Asn Ala 

6ly Xle Xle Pro Leu Gly Asn Thr Ser Asn Gin Xle Ser His Trp Asp 

U» Gly ser Ser Phe Phe Phe Ala Gly Thr Val Xle Thr Thr Xle Gly 

Phe X Asn Xle Ser Pro Arg Thr Glu Gly Gly Lys Xle Phe Cys Xle 

£ Tyr Ala Leu Leu Gly Xle Pro Leu Phe Gly Phe Leu Leu Ala Gly 

val Gly Asp Gin £ Gly Thr Xle Phe Gly Lys Gly Xle Ala Lys Val 

Glu Asp Thr Phe Xle Lys Trp Asn Val ser Gin Thr Lys Xle Arg Xle 

Ile ser Thr Xle Xle Phe Xle Leu Phe Gly Cys Val Leu Phe Val Ala 

« £ Ala Xle Xle Phe Lys His Xle Glu Gly Trp Ser Ala Leu Asp 

S Xle Tyr Phe Val Val Xle Thr Leu Thr Thr Xle Gly Phe Gly Asp 

^ val Ala Gly Gly Ser Asp Xle Glu Tyr Leu Asp Phe Tyr Lys Pro 

Val val Trp 2 -p Xle Leu Val Gly Leu Ala Tyr Phe Ala Ala Val 

275 t i »i„ Tn> Leu Arg Val He Ser Lys Lys Thr Lys 
Leu ser Met He Gly Asp Trp Leu i*rg ^ 

290 u l^t wis Ala Ala Glu Trp Thr Ala Asn 

Glu Glu Val Gly Glu Phe Arg Ala His Ala A±a v ^ 

VaX Thr Ala Glu Phe Lys Glu Thr Arg Arg Arg Leu Ser Val Glu Xle 

^ Ly s Phe Gin Arg Ala Thr Ser Xle Lys Arg Lys Leu Ser Ala 

Ota Leu Ala Gly Asn His Asn Gin Glu Leu Thr Pro Cys Arg Arg Thr 

^ ser val Asn His Leu Thr Ser Glu Arg Asp Val Leu Pro Pro Leu 

U» S Thr Glu ser Xle Tyr Leu Asn Gly Leu Thr Pro His Cys Ala 



390 



Gly Glu Glu lie Ala Val He Glu Asn lie Lys 



405 410 

<210> 84 

<211> 20 

<212> DNA 

<213> H. sapiens 

<400> 84 20 
catagccata ggtgaggact 

<210> 85 
<211> 20 
<212> DNA 

64 
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<213> H. sapiens 

<400> 85 
gagaggaaaa cagtctgggc 

<210> 86 
<211> 20 
<212> DMA 
<213> H. sapiens 

<400> 86 
ggacatcgaa ctaagacctg 

<210> 87 
<211> 20 
<212> DNA 
<213> H. sapiens 

<400> 87 
tcccatgcca ttcagatctg 



65 
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SEQ ID NO:10 tad K+Hnov protein of SEQ ID NO; 10 - ^ 4, „«, acid •eqaenca of SEQ ID 

oZp VI, cUi-(.)l-9. <«»wn .0 nucle* ^ SlS-*. to -id nucleic ec*U, 

NO.li. the nackic acid having the ^^^/^'^ «pro-i<» <^ P"*^ 8 

inn 1wf ^— ocmptMmg *"n nuclwo aoda. cell compnma g — in »»k<— ~~ 
K^UoTprotei. of SEQ ID NO.12 and K+Hnov protei. <f SI * ^ ^ ^ence of SEQ ID 

Group v£«Ui-WM. -taw. .0 aucWo ^nu^ac^ hazing to -id nuc.ee ac.de, 

NO.I4. the a«*c acid having the aequenc. of ae&oi & p^dag 

I i~. i- lt rr ~. eooixuiag -id nucleic aoda. cell compnamg ™J*^^ 

^Ivnrotei. of SEQ ID N014 and K+Hnov proteia of SEQ ID NO 14 

^ ^1— 1» udeie acidi eocodiag K+Hnov proten havmg the ammo nod K*Jueoce 01 
Oroop Vm. eh«(a)l-9. draw, to aacleto ^ ^~™» ^ ^bodiltog to -id aactoo aoda. 

NO.16. the having the "^J^f^^^-^^W- c«Z -ethod fcc producing 

r t at^ i^r^^of seq £ no,, :i n f^ 8 

I!!^irca»«o ooopwing -id nucleic acida. cell cooprwng expieaaK- ea-e». »e»oa v»~~ 
K+ZTpro-i. of SEQ ID NO 18 and ^^^^^ *. -ouk. ^ ^«ce of SEQ ID 
Ofovp X chi-(a)l-9. dmw» to iiuctoio actda "^? g J^^ acida hyWdaing «o aaid oudoie aoda, 

g^ rfB Qm»a»^^^^^^^^-P«.^"» 

0„»p M. «W-(.)W. » -Jf» ^^^^^-y^8«o-idn«cWc«id* 

I exDfesskm caaaeOe comprising sata d»«w - t i> urvi^ 

K^toTprote- of SEQ ID NO^J -^1^! Slaving the amino acid — queace of SEQ ID 
Group XH. cUaaWW. « J-* '-"*!■ JS^ST-* ihridixmg to -id nudeie acid^ 
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K+Hnov protein of SEQ ID N027 and 
Group XIII, claim(.)l-9. drawn to 
KOJO, the nucleic acid having the 



K+Hnov protein of SEQ ID NO30 and 
Group XIV, daim(i)l-9. diawa to 
NO:81, the nucleic acid having the 
expression cassette comprising laid 
K+Hnov protein of SEQ H> WWII and 
Group XV, elaim<e)I-0. drawn to m 
NO:83, the nucleic acid having the 
expression cassette comprising said 
K+Hnov potent of SEQ IDNO.83 and 
Group XVX claim(s)10, drawn to men* 
Group XVII. claim<s)Ii-14. drawn to 



K+Hnov protein of SEQ ID NO:27. 
acid, encoding K+Hnov protein having the amino acid sequence of SEQ IU 
rw* of SEO ID NO.28-29, nucleic acids hybridizing to said nucleic acuta, 
"da, cell comprising said expression cassette, method for producmg 

K+Hnov protein of SEQ ID NO30. rcvnm 
™b eSng K+Hnov protein having the ammo acd sequence of SEQ ID 
e of SEQ ID NO:80. nucleic acids hybridizing to said nucleic acids, 
sc^ «b comprising said expression cassette, method for producmg 
K+Hnov protein of SEQ ID NO:81. , wrim 
sTerS K+Hnov protein having the amino amd sequence of SEQ ID 

« of SEQ ID N082, nucleic acids hybnVUzmg to aaid nuck»eactd«, 
_ scids, cell comprising said expression cassette, method for producing 
K+Hnov protein of SEQ ID NO:83. 
sntmody that binds to K+Hnov. 

transgenic animal model for K+Hnov. 



„ i wit an not relate to a single inventive concept under PCT Rule 13.1 because, under 
The invention. Ustod as Group. ^^^^^^ to me foDowmg reasons: Group I is 

PCT Rule 13.2, they lack the ^^ t c ^^^'^ w of SE Q ID NO* nucleic acid, hybridizing to 
directed to nucleic acid (SEQ ID NO: I) said cuaette, method of producmg 

said nucleic acid, expression ^"^T* ^S^^SSkd feature i» the disclosed nucleic 
the K+Hnov of SEQ ID NO£ and the ^^f™* ^^to**** acid*, protein*, antibody and 
acid of SEQ ID NO: 1 -coding the ^tc^tZT^^^ the product, of 

transgenic annual -odd of Groups ^^^^^^^J ZTU the H+Nov proteins of SEQ ID 

»^iri4pi^h.yi»g«Bflci»ich n «c«»o--- — 
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I Prestations where ccrtaiu datmi were found unsearchable (Continuation of Item t of first »heet) 



Thit intend itporthuootto 

□ Claims Nosj 
hccmm Ihey relate to subject matter not required to be searcbed by this Authority, namely: 



2 ' fU becW^ relate to parts of the mteniatiooal application that do not comply with the prescribed retirements to such 
at extent that no meaningful mteraatioual search canto carried out, specifically: 



3 * ^ leX^are dependents 



ill 



where salty of invention Is lacking (Continuation of Item 2 of first 



This btemational Searching Authority found multiple mventkms in this mteraational application, i 
Please See Extra Sheet 



Q ^..^.ddi^ 

[I As sflseaicht** claims^ fee, mis Authority did ncrtmvitom 

ofsnyad^tkmaliee. 
PI As os^soaae of the retired addkkn^ 

1-1 only those claims for which tees were paid, specifically claims Noa.: 



4 n no (waiind additional search foes were timely paid by the applicant Consequently, this 
Li* nrtnetod to the invention first mentioned in the claims; it U covered by claims Nosj 
1-9, SEQ ID NO:l and 2 



Raniarkoai Protest [] The additional search fees were accompanied by the applicant's protest 
Q No protest accompanied the payment of additional search fees. 
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